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PREFACE. 


The usual point of view in che study of mechanics is that 
where the attention is mainly directed to the changes which 
take place in the course of time in a given system. The prin¬ 
cipal problem is the determination of the condition of the 
system with respect to configuration and velocities at any 
required time, when its condition in these respects has been 
given for some one time, and the fundamental equations are 
those which express the changes continually taking place in 
the system. Inquiries of this kind are often simplified by 
taking into consideration conditions of the system other than 
those through which it actually passes or is supposed to pass, 
but our attention is not usually carried beyond conditions 
differing infinitesimally from those which are regarded as 
actual. 

For some purposes, however, it is desirable to take a broader 
view of the subject. We may imagine a great number of 
systems of the same nature, but differing in the configura¬ 
tions and velocities which they have at a given instant, and 
differing not merely infinitesimally, but it may he so as to 
embrace every conceivable combination of configuration and 
velocities- And here we may set the problem, not to follow 
a particular system through its succession of configurations, 
but to determine how the whole number of systems will be 
distributed among the various conceivable configurations and 
velocities at any required time, when the distribution has 
been given for some one time. The fundamental equation 
for this inquiry is that which gives tlie rate of change of the 
number of systems which fall within any infinitesimal limits 
of configuration and velocity. 
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phenomena distinctively thermodynamic, we do not escape 
difficulties in as simple a matter as the number of degrees 
of freedom of a diatomic gas. It is well known that while 
theory would assign to the gas six degrees of freedom per 
molecule, in our experiments on specific heat we cannot ac¬ 
count for more than five. Certainly, one is building on an 
insecure foundation, who rests his work on h3rpothe8e8 con¬ 
cerning the constitution of matter. 

Difficulties of this kind have deterred the author from at¬ 
tempting to explain the mysteries of nature, and have forced 
him to be contented with the more modest aim of deducing 
some of the more obvious propositions relating to the statis¬ 
tical branch of mechanics. Here, there can be no mistake in 
regard to the agreement of the hypotheses with the facts of 
nature, for nothing is assumed in that respect. The only 
error into which one can fall, is the want of agreement be¬ 
tween the premises and the conclusions, and this, with care, 
one may hope, in the main, to avoid. 

The matter of the present volume consists in large measure 
of results which have been obtained by the investigators 
mentioned above, although the point of view and the arrange¬ 
ment may be different. These results, given to the public 
one by one in the order of their discovery, have necessarily, 
in their original presentation, not been arranged in the most 
logical manner. 

In the first chapter we consider the general problem which 
has been mentioned, and find what may be called the funda¬ 
mental equation of statistical mechanics. A particular case 
of this equation will give the condition of statistical equi¬ 
librium, i, e.^ the condition which the distribution of the 
systems in phase must satisfy in order that the distribution 
shall be permanent. In the general case, the fundamental 
equation admits an integration, which gives a principle which 
may be variously expressed, according to the point of view 
from which it is regarded, as the conservation of density-in¬ 
phase, or of extension-in-phase, or of probability of phase. 
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In the second chapter, we apply this principle of conserva¬ 
tion of probability of phase to the theory of errors in the 
calculated phases of a system, when the determination of the 
arbitrary constants of the integral equations are subject to 
error. In this application, we do not go beyond the usual 
ai)proximations. In other words, we combine the principle 
of conservation of probability of phase, which is exact, with 
those approximate relations, which it is customary to assume 
in the “ theory of errors.” 

In tlie third chapter we apply the principle of conservation 
of extension-in-pluise to the integration of the differential 
equations of motion. This gives Jacobi's “last multiplier,” 
as has been shown by Boltzmann. 

In the fourth and following chapters we return to the con¬ 
sideration of statistical equilibrium, and confine our attention 
to conservative systems. We consider c^specially ensembles 
of systems in which the index (or logarithm) of probability of 
phase is a linear function of the energy. This distribution, 
on account of its unique imjiortance in the theory of statisti¬ 
cal equilibrium, I have ventured to call canonical^ and the 
divisor of the energy, the modvlus of distribution. The 
moduli of ensembles have properties analogous to temperature, 
in that equality of the moduli is a condition of equilibrium 
with respect to exchange of energy, when such exchange is 
made possible. 

We find a differential equation relating to average values 
in the ensemble which is identical in form with tlie funda¬ 
mental differential equation of thermodynamics, the average 
index of probability of phase, with change of sign, correspond¬ 
ing to entropy, and the modulus to temperature 

For the average square of the anomalies of the energy, we 
find an expression which vanishes in comparison with the 
square of the average energy, when the number of degrees 
of freedom is indefinitely increased. An ensemble of systems 
in which the number of degrees of freedom is of the same 
order of magnitude as the number of molecules in the bodies 
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Tdth which we experiment, if distributed canonically, would 
therefore appear to human observation as an ensemble of 
sj^stems in which all have the same energy. 

We meet with other quantities, in the development of the 
subject, which, when the number of degrees of freedom is 
ver}^ great, coincide sensibly with the modulus, and with the 
average index of probability, taken negatively, in a canonical 
ensemble, and wliich, therefore, may also be regarded as cor¬ 
responding to temperature and entropy. The correspondence 
is however imperfect, when the number of degrees of freedom 
is not very great, and there is nothing to recommend these 
quantities except that in definition they may be regarded as 
more simple than those which have been mentioned. In 
Chapter XIV, this subject of thermodynamic analogies is 
discussed somewhat at length. 

Finally, in Chapter XV, we consider the modifierftion of 
the preceding results which is necessaiy when we consider 
systems composed of a number of entirely similar particles, 
or, it may be, of a number of particles of several kinds, all of 
each kind being entirely similar to each oilier, and when one 
of the variations to be considered is that of the numbers of 
the particles of the various kinds winch are contained in a 
syst(!m. This supposition would naturally have been intro¬ 
duced earlier, if our object had been simply the expression of 
the laws of nature. It seemed desirable, however, to separate 
sharply the purely thermodynamic laws from those special 
modifications which belong rather to the theory of the prop¬ 
erties of matter. 


J. W. G. 


New Haven, December, 1901. 
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ELEMENTARY PRINCIPLES IN 
STATISTICAL MECHANICS 


CHAPTER I. 

GENERAL NOTIONS. THE PRINCIPLE OF CONSERVATION 
OF EXTENSION-IN-PHASE. 

We shall use Hamilton’s form of the equations of motion for 
a system of n degrees of freedom, writing . .qn for the 
(generalized) coordinates, qi^ • • • qn for the (generalized) ve¬ 
locities, and 

FidqiF2dq2 » ‘ + Fndq^ (1) 

for the moment of the forces. We shall call the quantities 
JF’j,... the (genendized) forces, and the quantities jPj... 
defined by the equations 

d€p d€p , /o\ 

= i ?2 = -“-r, etc., (2) 

dq\ dq2 

where denotes the kinetic energy of the system, the (gen¬ 
eralized) momenta. The kinetic energy is here regarded as 
a function of the velocities and coordinates. We shall usually 
regard it as a function of the momenta and coordinates,* 
and on this account we denote it by This will not pre¬ 
vent us from occasionally using formulae like (2), where it is. 
sufficiently evident the kinetic energy is regarded as function 
of the y’s and g^’s. But in expressions like depfdq^ , where the 
denominator does not determine the question, the kinetic 

♦ The use of the momenta instead of the velocities as independent variables 
is the characteristic of Hamilton's method which gives his equations of motion 
their remarkable degree of simplicity. We shall find that the fundamental 
notions of statistical mechanics are most easily defined, and are expressed in 
the most simple form, when the momenta with the coordinates are used to 
describe the state of a system. 
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HAMILTON'S EQUATIONS. 


energy is always to be treated in the differentiation as function 
of the p's and q's. 

We have then 

These equations will hold for any forces whatever. If the 
forces are conservative, in other words, if the expression (1) 
is an exact differential, we may set 




dq,' 




dq^' 


etc., 


(4) 


where is a function of the coonlinates which we shall call 
the potential energy of the system. If we write e for the 
total energy, we shall have 


€ = + €,, 

and equations (3) may be written 

. de • de 


etc. 


(5) 

( 6 ) 


The potential energy (e^) may depend on other variables 
beside the coordinates q^... q^^ We shall often suppose it to 
depend in part on cotirdinates of external b(xlies, which we 
shall denote by a^, etc. We shall then have for the com¬ 
plete value of the differential of the potential energy * 

d€g = ^ Fidqi . .dq^ — Ai da^ — Az da^ — etc., (7) 

where etc., represent forces (in the generalized sense) 

exerted by the system on external bodies. For the total energy 
(e) we shall have 


d€=qidpt . , . + kndp^—pi dq^. .. 

— p^ dq^ — Ax doi — A^ da^ — etc. (8) 

It will be observed that the kinetic energy (e^) in the 
most general case is a quadratic function of the p's (or y’s) 


♦ It will be observed, that although we call the potential energy of the 
system which we are considering, it is really so defined as to include that 
energy which might be described as mutual to that system and external 
bodies. 
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involving also the y’s but not the a’s ; that the potential energy, 
when it exists, is function of the j’s and a’s; and that the 
total energy, when it exists, is function of the jt?’s (or j’s), the 
j’s, and the a’s. In expressions like dejdq^^ the jt^’s, and not 
the y’s, are to be taken as independent variables, as has already 
been stated with respect to the kinetic energy. 

Let us imagine a great number of independent systems, 
identical in nature, but differing in phase, that is, in their 
condition with respect to configuration and velocity. The 
forces are supposed to be determined for every system by the 
same law, being functions of the cotirdinates of the system 
j'j, ... either alone or with the coordinates a^, etc. of 
certain external bodies. It is not necessary that they should 
be derivable from a force-function. The external coordinates 
aj, ^2, etc. may vary with the time, but at any given time 
have fixed values. In this they differ from the internal 
coordinates • • • 9n» which at the same time have different 
values in the different systems considered. 

Let us especially consider the number of systems which at a 
given instant fall within specified limits of phase, viz., those 
for which 

Pi' <Pi< Pi'j <^i< ^I'i 

Pi < Pi < Pi", <qi< qi", 

Pn <Pn< Pn", qn < 

the accented letters denoting constants. We shall suppose 
the differences pj" — q(' — q{y etc. to be infinitesimal, and 

that the systems are distributed in phase in some continuous 
manner,* so that the number having phases within the limits 
specified may be represented by 

{px - Pi ).. • {pi' - pi) (qi' - qi) • - • (qi' - qi)^ (lo) 

♦ In strictness, a finite number of systems cannot be distributed contin¬ 
uously in phase. But by increasing indefinitely the number of systems, we 
may approximate to a continuous law of distribution, such as is here 
described. To avoid tedious circumlocution, language like the above may 
be allowed, although wanting in precision of expression, when the sense in 
which it is to be taken appears suflBciently clear. 
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VARIATION OF TEE 


or more briefly by 

Ddp^. .. dp^ dqi, . . dq^, ( 11 ) 

where i> is a function of the jp’s and q*8 and in general of t also, 
for as time goes on, and the individual systems change their 
phases, the distribution of the ensemble in phase will in gen¬ 
eral vary. In special cases, the distribution in phase will 
remain unchanged. These are cases of itatistical equilibrium- 

If we regard all possible phases as forming a sort of exten¬ 
sion of 2 n dimensions, we may regard the product of differ¬ 
entials in (11) as expressing an element of this extension, and 
D as expressing the density of the systems in that element. 
We shall call the product 

djDi . . . dp^ dqi. . . dq^ (12) 

an element of extension-in-phasey and D the densitp-div-phase 
of the systems. 

It is evident that the changes which take place in the den¬ 
sity of the systems in any given element of extension-in- 
phase will depend on the dynamical nature of the systems 
and their distribution in phase at the time considei^ed. 

In the case of conservative systems, with which we shall be 
principally concerned, their dynamical nature is completely 
determined by the function which expresses the energy (e) in 
terms of the p’s, j’s, and a’s (a function supposed identical 
for all the systems); in the more general case which we are 
considering, the dynamical nature of the systems is deter¬ 
mined by the functions which express the kinetic energy (e^) 
ill terms of the p’s and q% and the forces in terms of the 
j’s and a’s. The distribution in phase is expressed for the 
time considered by i? as function of the p’s and g^’s. To find 
the value of dD/dt for the specified element of extension-in- 
phase, we observe that the number of systems within the 
limits can only be varied by systems passing the limits, which 
may take place in 4 n different ways, viz., by the p^ of a sys¬ 
tem passing the limit p/, or the limit p^", or by the q^ of a 
system passing the hmit q^\ or the limit j/', etc. Let us 
consider these cases separately. 
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In the first place, let us consider the number of systems 
which in the time dt pass into or out of the specified element 
by passing the limit jt?/. It wiU be convenient, and it is 
evidently allowable, to suppose dt so small that the quantities 
Pi d,t^ dt^ etc., which represent the increments of pj, jj, etc., 
in the time dt shall be infinitely small in comparison with 
the infinitesimal differences p^' — etc., which de¬ 

termine the magnitude of the element of extension-in-phase. 
The systems for which p^ passes the limit p^ in the interval 
dt are those for which at the commencement of this interval 
the value of p^ lies between p^ and p^ — p^ dt^ as is evident 
if we consider separately the cases in which jpj is positive and 
negative. Those systems for which p^ lies between these 
limits, and the other />’s and ^’s between the limits specified in 

(9) , will therefore pass into or out of the element considered 
according as ^ is positive or negative, unless indeed they also 
pass some other Hmit specified in (9) during the same inter¬ 
val of time. But the number which pass any two of these 
limits wiU be represented by an expression containing the 
square of dt as a factor, and is evidently negligible, when dt 
is sufficiently small, compared with the number which we are 
seeking to evaluate, and which (with neglect of terms contain¬ 
ing dfi) may be found by substituting p^ dt for p^' — p^ in 

(10) or for dpj in (11). 

The expression 

Dp^ dt dp^. . , dp^ dqi. . , dq^ (13) 

will therefore represent, according as it is positive or negative, 
the increase or decrease of the number of systems within the 
given limits which is due to systems passing the limit p^', A 
similar expression, in which however D and p will have 
slightly different values (being determined for p/' instead of 
p^')y will represent the decrease or increase of the number of 
systems due to the passing of the limit p^". The difference 
of the two expressions, or 
djDpi) 

dpi 


dpi . . . dp^ dqi. . . dq^ dt 


(14) 
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will represent algebraically the decrease of the number of 
systems within the limits due to systems passing the limits 
and jPj". 

The decrease in the number of systems within the limits 
due to systems passing the limits and y/' may be found in 
the same way. This will give 


■ ^Pn dq„ dt (15) 

for the decrease due to passing the four limits p{^ jo/', y/, 9 /'. 
But since the equations of motion (3) give 


dp, dq, ~ 


(16) 


the expression reduces to 
idD • dD • 


dq, --dp^dq^.,, dq^ dt. (17) 


If we prefix 2 to denote summation reLative to the suffixes 
1 . . . w, we get the totiil decrease in the number of systems 
witliin the limits in the time dt. That is, 


S •■■'iPndq,... dq^ dt = 

— dD dp, . . . dp^ dq, . 


• dq^j 


or 






dt),, 


\dPi 


Pi + 


^ • 

dq 




(18) 

(19) 


where the suffix applied to the differential coefficient indicates 
that the p's and q's are to be regarded as constant in the differ¬ 
entiation. The condition of statistical equilibrium is therefore 


2 


UD • dD 

\dpi^^ ^ dqi 



= 0. 


( 20 ) 


If at any instant this condition is fulfilled for all values of the 
p’s and j’s, (rfZ>/o?Op,g vanishes, and therefore the condition 
will continue to hold, and the distribution in phase will be 
permanent, so long as the external coordinates remain constant. 
But the statistical equilibrium would in general be disturbed 
by a change in the values of the external coordinates, which 
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■would alter the values of the p’s as determined by equations 
(3), and thus disturb the relation expressed in the last equation. 
If we write equation (19) in the form 



+ 5 


(dD. , dD - , 


)= 0 . 


( 21 ) 


it will be seen to express a theorem of remarkable simplicity. 
Since 7> is a function of t, its complete 

differential will consist of parts due to the variations of all 
these quantities. Now the first term of the equation repre¬ 
sents the increment of I) due to an increment of t (with con¬ 
stant values of thejo’s and j’s),and the rest of the first member 
represents the increments of D due to increments of the p’s 
and g-’s, expressed by p^ dt^ q^ dt^ etc. But these are precisely 
the increments which the p’s and ^^’s receive in the movement 
of a system in the time dt. The whole expression represents 
the total increment of D for tlie varying phase of a moving 
system. We have therefore the theorem: — 

In an ensemble of mechanical systems identical in nature and 
subject to forces determined by identical laws^ but distributed 
in phase in any continuous manner^ the density-in-phase is 
constant in time for the varying phases of a moving system ; 
provided^ that the forces of a system are functions of its co¬ 
ordinates^ either alone or with the time,* 

This may be called the principle of conservation of density- 
in-phase, It may also be written 



where , ,.h represent the arbitrary consfiints of the integral 
equations of motion, and are suffixed to the differential co- 


* The condition that the forces Fi, ...Fn are functions of and 

ai,a2teic., whjch last are functions of the time, is analytically equivalent 
to the condition that Fi. .., Fn are functions of qi, ...qn and the time. 
Explicit mention of the external coordinates, Uj, ^2» been made in 

the preceding pages, because our purpose will require us hereafter to con¬ 
sider these coordinates and the connected forces, Ai, which repre¬ 

sent the action of the systems on external bodies. 
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efficient to indicate that they are to be regarded as constant 
in the differentiation. 

We may give to this principle a slightly different expres¬ 
sion. Let us call the value of the integral 

/■ • ■/'*'' .. .dp^dq^. . . dq^ (23) 


taken within any limits the extenBion-in-phase within those 
limits. 

When the phases hounding an extension^in-phase vary in 
the course of time according to the dynamical laws of a system 
subject to forces which are functions of the coordinates either 
alone or with the time^ the value of the extension-in-phase thus 
bounded remains constant. In this form the principle may be 
called the principle of conservation of extension-in-phase. In 
some respects this may be regarded as the most simple state¬ 
ment of the principle, since it contains no explicit reference 
to an ensemble of systems. 

Since any extension-in-phase may be divided into infinitesi¬ 
mal portions, it is only necessary to prove the principle for 
an infinitely small extension. The number of systems of an 
ensemble which fall within the extension will he represented 
by the integral 


j^. . .J^Ddpi . . . dp^ dqi . . . dq„. 


If the extension is infinitely small, we may regard D as con¬ 
stant in tlie extension and write 


Df...Jdpi... dp, dqi . . . 


dq. 


for the number of systems. The value of this expression must 
be constant in time, since no systems are supposed to be 
created or destroyed, and none can pass the limits, because 
the motion of the limits is identical witli that of the systems. 
But we have seen that I) is constant in time, and therefore 
the integral 


/• • ■ • 


dp, dqi. .. dq. 
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which we have called the extension-in-phase, is also constant 
in time.* 

Since the system of coordinates employed in the foregoing 
discussion is entirely arbitrary, the values of the coordinates 
relating to any configuration and its immediate vicinity do 
not impose any restriction upon the values relating to other 
configurations. The fact that the quantity which we have 
called density-in-phase is constant in time for any given sys¬ 
tem, implies therefore that its value is independent of the 
coordinates which are used in its evaluation. For let the 
density-in-phase as evaluated for the same time and phase by 
one system of coordinates be jD/, and by another system 
A system which at that time has that phase will at another 
time have another phase. Let the density as calculated for 
this second time and phase by a third system of coordinates 
be jDg". Now we may imagine a system of cob'rdinates which 
at and near the first configuration will coincide with the first 
system of coordinates, and at and near the second configuration 
will coincide with the third system of coordinates. This will 
give jDj' = Dg". Again we may imagine a system of coordi¬ 
nates which at and near the first configuration will coincide 
with the second system of coordinates, and at and near the 


* If we regard a phase as represented by a point in space of 2 n dimen¬ 
sions, the changes which take place in the course of time in our ensemble of 
systems will be represented by a current in such space. This current will 
be steady so long as the external coordinates are not varied. In any case 
the current will satisfy a law which in its various expressions is analogous 
to the hydrodynamic law which may be expressed by the phrases conserva¬ 
tion of volumes or conservation of density about a moving pointy or by the equation 


dx dy dz 
dx dy dz 


0 . 


The analogue in statistical mechanics of this equation, viz., 

^ 1*2 «V2 


may be derived directly from equations (3) or (6), and may suggest such 
theorems as have been enunciated, if indeed it is not regarded as making 
them intuitively evident. The somewhat lengthy demonstrations given 
above will at least serve to give precision to the notions involved, and 
familiarity with their use. 
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second configuration will coincide with the third system of 
coordinates. This will give We have therefore 

It follows, or it may be proved in the same way, tliat the 
value of an extension-in-phase is indej)endent of the system 
of coordinates which is used in its evaluation. This may 
easily be verified directly. If Qi^ • • • Qn fwo 

systems of coordinates, and Pi, • • • Pn^ A’ • • • cor¬ 

responding momenta, we liave to prove that 


S-h' 


•<iPndqi. 




dP,,..dP,dQ,.,.dQ,,(24) 


when the multiple integrals are taken within limits consisting 
of the same phases. And this will be evident from the prin¬ 
ciple on which we change the variables in a multiple integral, 
if we prove that 

d(P,, . . . i>., . (>,) ^ ^ 

2 ^1, ... ^ ^ ^ 
where the first member of the equation represents a Jacobian 
or functional determinant. Since all its elements of the form 
dQ/dp are equal to zero, the detenninant reduces to a product 
of two, and we have to prove that 


(26) 

^{Pu--Pn) 

We may transform any element of the first of these deter¬ 
minants as follows. By equations (2) and (3), and in 
view of the fact that the Q'b are linear functions of the q's 
and therefore of the jp’s, with coefficients involving the g^’s, 
so that a differential coefficient of the form dQ^jdpy is function 
of the g^’s alone, we get * 

* The form of the equation 

d dtp _ d dtp 

dpy d(^ dQx dpy 

in (27) reminds us of the fundamental identity in the differential calculus 
relating to the order of differentiation with respect to independent variables. 
But it will be observed that here the variables Qx and py are not independent 
and that the proof depends on the linear relation between the Q^s and the p’s. 
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dQjc >■=1 dQ^ dpy ) 



d _ d de^ 

dpj dQ^ dp. 

II 

(27) 

But since 




Therefore, 

dky _ ^ 
dQ, dQ,' 


(28) 


• • -^n) _ d(qi, . . . q^y d(^qif . . 

• $'«) 

(29) 

d{p^, . 

■■Pn) diQu... Q„)~~ d(Q„.. 

.<?.)* 

The equation to be proved is thus reduced to 




d(qi ,... ?„) ... Qn) . 


(30) 


d(Qi ,.. . 4>.) d(q„ . .. y,) - 



which is easily proved by the ordinary rule for the multiplica¬ 
tion of determinants. 

The numerical value of an extension-in-phase will however 
depend on the units in which we measure energy and time. 
For a product of the form dp dq has the dimensions of energy 
multiplied by time, as appears from equation (2), by which 
the momenta are defined. Hence an extension-in-phase has 
the dimensions of the nth power of the product of energy 
and time. In other words, it has the dimensions of the nth 
power of action^ as the term is used in the ‘ principle of Lea%t 
Action.'" 

If we distinguish by accents the values of the momenta 
and cocirdinates which belong to a time the unaccented 
letters relating to the time the principle of the conserva¬ 
tion of extension-in-phase may be written 

^ dpi ... dp^dqx ... dq^^^ z=:J’...j' dpi> ...dpjdqi' ...dq^', (31) 

or more briefly 

j'. . .j'dpi. . .dq,=j’.. .Jdpi'.. . dqj, 


(32) 



14 


C0NSERVA770N OF 


the limiting phases being those which belong to the same 
systems at the times t and tf respectively. But we have 
identically 




dpi' . . . dq^' 


for such limits. The principle of conservation of extension-in¬ 
phase may therefore be expressed in the form 


d(pu - gn) _ j 

d( Pi' . . . ijn') 


(33) 


This equation is easily proved directly. For we have 
identically 

d(Ply •••qn ) _ d(p^ , • • - Qn) Pi" * ^Zn") 
d(Pi'y . • • 'Zn) d(pi'\ . . . q^") . . . qj) * 

where the double accents distinguish the values of the momenta 
and coordinates for a time t". If we vary while tf and t" 
remain constant, we have 


d d(pu . . . d d{ ^n) ^ /3.x 

di d{p^\ , . . qj) c/( .. . Qrf) dt d(pi'^y ... qj') ‘ ^ 

Now since- the time t'' is entirely arbitrary, nothijig prevents 
us from making tf' identical with I at the moment considered. 
Then the determinant 

d(piy - 

will have unity for each of the elements on the principal 
diagonal, and zero for all the other elements. Since every 
term of the determinant except the product of the elements 
on the principal diagonal will have two zero factors, the differen¬ 
tial of the determinant will reduce to that of the product of 
these elements, i, e,, to the sum of the differentials of these 
elements. This gives the equation 

^ d(Vi y •••qn) __ dpi dp^ dq^ 

dt d(p^^, ... ip/' • • • ip/' dqjf * 

Now since t = the double accents in the second member 
of this equation may evidently be neglected. This will give, 
in virtue of such relations as (16), 
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dtdW,...qjr) » 

which substituted in (34) will give 

d d(j>r ,...qn) ^ ^ 
dt d(pi'y ...qj) 

The determinant in this equation is therefore a constant, the 
value of which may be determined at the instant when t = 
when it is evidently unity. Equation (33) is therefore 
demonstrated. 

Again, if we write a,... A for a system of 2 w arbitrary con¬ 
stants of the integral equations of motion, etc. will be 

functions of a, . . . 7t, and t, and we may express an extension- 
in-phase in the form 



If we suppose the limits specified by values of a, ... A, a 
system initially at the limits will remain at the limits. 
The principle of conservation of extension-in-phase requires 
that an extension thus bounded shall have a constant value. 
This requires that the determinant under the integral sign 
shall be constant, which may be written 


dt d{a, ... A) 


(36) 


This equation, which may be regarded as expressing the prin¬ 
ciple of conservation of extension-in-phase, may be derived 
directly from the identity 


dj piy _ d(yi, ...g,) d( j?/, .. . 

dioiy ... A) d{pt\ ... qj) d{ay ... A) 


in connection with equation (33). 

Since the coordinates and momenta are functions of a, ... A, 
and ty tlie determinant in (36) must be a function of the same 
variables, and since it does not vary with the time, it must 
be a function of a, ... A alone. We have therefore 


d{pu ---gn) 
d^cL ^... A) 


func. (a, ... A). 


(37) 
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It is the relative numbers of systems which fall within dif¬ 
ferent limits, rather than the absolute numbers, with which we 
are most concerned. It is indeed only with regard to relative 
numbers that such discussions as the preceding will apply 
with literal precision, since the nature of our reasoning implies 
that the number of systems in the smallest element of space 
which we consider is very great. This is evidently inconsist¬ 
ent vrith a finite value of the total number of systems, or of 
the density-in-phase. Now if the value of D is infinite, we 
cannot speak of any definite number of systems within any 
finite limits, since all such numbers are infinite. But the 
ratios of these infinite numbers may be perfectly definite. If 
we write N for the total number of systems, and set 

^ = (38) 

P may remain finite, when N and D become infinite. The 

integral 

dp,... dq„ (39) 

taken within any given limits, will evidently express the ratio 
of the number of systems falling within those limits to the 
whole number of systems. This is the same thing as the 
probability that an unspecified system of the ensemble (i. e. 
one of which we only know that it belongs to the ensemble) 
will lie within the given limits. The product 

Pdp^.,.dq^ (40) 

expresses the probability that an unspecified system of the 
ensemble will be found in the element of extension-in-phase 
dpx ... We shall call P the coefficient of probability of the 
phase considered. Its natural logarithm we shall call the 
index of probability of the phase, and denote it by the letter 
If we substitute NP and Ne^ for D in equation (19), we get 
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Tlie condition of statistical equilibrium may be expressed 
by equating to zero the second member of either of these 
equations. 

The same substitutions in (22) give 



O.. 

(43) 

and 


(44) 


That is, the values of P and like those of D, are constant 
in time for moving systems of the ensemble. From this point 
of view, the principle which otherwise regarded has been 
called the principle of conservation of density-in-phase or 
conservation of extension-in-pliase, may be called the prin¬ 
ciple of conservation of the coefficient (or index) of proba¬ 
bility of a phase varying according to dynamical laws, or 
more briefly, the primaple of conservation of probability of 
phase. It is subject to the limikition tliat tlie forces must be 
functions of the cocirdinates of the system either alone or with 
the time. 

Tlie application of this principle is not limited to cases in 
which there is a formal and explicit reference to an ensemble of 
systems. Yet the conception of su(;h an ensemble may serve 
to give precision to notions of probability. It is in fact cus¬ 
tomary i]i tlie discussion of probabilities to describe anything 
which is imperfectly known as something taken at random 
from a great number of things which are completely described. 
But if we prefer to avoid any reference to an ensemble 
of systems, we may observe that the probability that the 
phase of a system falls witliin certain limits at a certain time, 
is equal to the probability that at some other time the phase 
will fall within the limits formed by phases corresponding to 
the first. P^or either occurrence necessihites the other. That 
is, if we write P' for the coefficient of probability of the 
phase j?/, • • • qn at the time f', and P" for that of the phase 
P\ \ • • • qn at the time 
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f - fP' dq,' . . . dq: =/•••/ P" dpi' • • • dq:', (46) 

where the limits in the two cases are formed by corresponding 
phases. When the integrations cover infinitely small vari¬ 
ations of the momenta and coordinates, we may regard P' and 
P" as constant in the integrations and write 

F' J. . .J dp,' . . . dq^" = F'J.. .J dp," . .. dqj'. 

Now the principle of the conservation of extension-in-phase, 
which has been proved (viz., in, the second demonstration given 
above) independently of any reference to an ensemble of 
systems, requires that the values of the multiple integrals in 
this equation shall be equal. This gives 

P" = P'. 

With reference to an important class of cases this principle 
may be enunciated as follows. 

When the differential equations of motion are exactly known^ 
but the constants of the integral equations imperfectly deter- 
mined^ the coefficient of probability of any phase at any time is 
equal to the coefficient of probability of the corresponding phase 
at any other time. By corresponding phases are meant those 
which are calculated for different times from the same values 
of the arbitrary constants of the integral equations. 

Since the sum of the probabilities of all possible cases is 
necessarily unity, it is evident that we must have 

aU 

/■ •/" dp,... dq., — 1, (46) 

ph»MB 

where the integration extends over all phases. This is indeed 
only a different form of the equation 

all 

N=f...fl)dp,...dq,., 

phJMM 


which we may regard as defining iK 
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The values of the coefficient and index of probability of 
phase, like that of the density-in-phase, are independent of the 
system of coordinates which is employed to express the distri¬ 
bution in phase of a given ensemble. 

In dimensions, the coefficient of probability is the reciprocal 
of an extension-in-phase, that is, the reciprocal of the wth 
power of the product of time and energy. The index of prob¬ 
ability is therefore affected by an additive constant when we 
change our units of time and energy. If the unit of time is 
multiplied by c^ and the unit of energy is multiplied by c,, aU 
indices of probability relating to systems of n degrees of 
freedom will be increased by the addition of 

n log c, + n log c,. (47) 



CHAPTER IL 


APPLICATION OF THE PRINCIPLE OF CONSERVATION 
OF EXTENSION-IN-PIIASE TO THE THEORY 
OF ERRORS. 

Let us now proceed to combine the principle which has been 
demonstrated in the preceding chapter and which in its difier- 
ent applications and regarded from different points of view 
has been variously designated as the conservation of density- 
in-phase, or of extension-in-phase, or of probability of phase, 
with those approximate relations which are generally used in 
the ‘theory of errors.’ 

We suppose that the differential equations of the motion of 
a system are exactly known, but that the constants of the 
integral equations are only approximately determined. It is 
evident that the probability that the momenta and coordinates 
at the time fall between the limits and 4 dp^^ and 
9.1 + expressed by the formula 

e-^ dp,’... dgj, (48) 

where rj' (the index of probability for the phase in question) is 
a function of the coordinates and momenta and of the time. 

Let P/, etc. be the values of the coordinates and momenta 
which give the maximum value to t;', and let the genenil 
value of 7)' be developed by Taylor’s theorem according to 
ascending powers and products of the differences p^ — P/, 
qi — etc., and let us suppose that we have a suflScient 
approximation without going beyond terms of the second 
degree in these differences. We may therefore set 

c - P, (49) 

wdiere c is independent of the differences 

etc., and P' is a homogeneous quadratic function of these 
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differences. The terms of the first degree vanish in virtue 
of the maximum condition, which also requires that F' must 
have a positive value except when all the differences men¬ 
tioned vanish. If we set 


0=e% (50) 

we may write for the probability that the phase lies within 
the limits considered 

Ce-F' . . .dq^. (51) 

C is evidently the maximum value of the coefficient of proba¬ 
bility at the time considered. 

In regard to the degree of approximation represented by 
these formulae, it is to be observed that we suppose, as is 
usual in the Hheory of errors,’ that the determination (ex¬ 
plicit or implicit) of the constants of motion is of such 
precision that the coefficient of probability or is 

practically zero except for very small values of the differences 
— Pj', etc. For very small values of these 

differences the approximation is evidently in general sufficient, 
for larger values of these differences the value of Ce~^' wiU 
be sensibly zero, as it should be, and in this sense the formula 
will represent the facts. 

We shall suppose that the forces to which the system is 
subject are functions of the coordinates either alone or with 
the time. The principle of conservation of probability of 
phase will therefore apply, which requires that at any other 
time (^") the maximum value of the coefficient of probability 
shall be the same as at the time t\ and that the phase 
{Pi \ Qi \ etc.) which has this greatest probability-coefficient, 
shall be that wffiich corresponds to the phase (P/, etc.), 
z. e., which is calculated from the same values of the constants 
of the integral equations of motion. 

We may therefore write for the probability that the phase 
at the time falls within the limits and j?/' 4- 
and 2 '/' -f- etc., 

Ce-^'^dp^^ ... dq:\ 


( 62 ) 
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where C represents the same value as in the preceding 
formula, viz., the constant value of the maximum coefficient 
of probability, and F” is a quadratic function of the differences 
Pi — Pi \ qi — Qi \ etc., the phase (P/', etc.) being that 
which at the time f corresponds to the phase (P/, etc.) 
at the time 

Now we have necessarily 


J.. J Cer^df^ ... dq:= J.. .J Ce-^"dp," ... dq;’ = 1, (63) 


when the integration is extended over all possible phases. 
It will be allowable to set ± oo for the limits of all the coor¬ 
dinates and momenta, not because these values represent the 
actual limits of possible phases, but because the portions of 
the integrals lying outside of the limits of all possible phases 
will have sensibly the value zero. With ± oo for limits, the 
equation gives 


Vf' ~ VT' 


(64) 


where /' is the discriminant * of F\ and /" that of P". This 
discriminant is therefore constant in time, and like C an abso¬ 
lute invariant in respect to the system of coordinates which 
may be employed. In dimensions, like C\ it is the reciprocal 
of the 2nth power of the product of energy and time. 

Let us see precisely how the functions P' and P'' are related. 
The principle of the conservation of the probability-coefficient 
requires that any values of the coordinates and momenta at the 
time shall give the function P' the same value as the corre¬ 
sponding coordinates and momenta at the time give to P". 
Therefore P" may be derived from P' by substituting for 
Pi, •. .qn their values in terms of p/', ... Now we 

have approximately 


* This term is used to denote the determinant having for elements on the 
principal diagonal the coefficients of the squares in the quadratic function 
and for its other elements the halves of the coefficients of the products 
in F', 
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Pi'- Pi'= ^ (Pi"-Pi") 




dQ, 


9n'-QJ = ^^(Pl"-Pl") 




( 66 ) 


and as in F" terms of higher degree than the second are to be 
neglected, these equations may be considered accurate for the 
purpose of the transformation required. Since by equation 
(83) the eliminant of these equations -has the value unity, 
the discriminant of F'' 'will be equal to that of F\ as has 
already appeared from the consideration of the principle of 
conservation of probability of phase, which is, in fact, essen- 
tMly the same as that expressed by equation (33). 

At the time the phases satisfying the equation 


Ff k, 


(56) 


where k is any positive constant, have the probability-eoeffi- 
cient C e"”*. At the time the corresponding phases satisfy 
the equation 

Ff^ = Ar, (57) 


and have the same probability-coefficient. So also the phases 
within the limits given by one or the other of these equations 
are corresponding phases, and have probability-coefficients 
greater than Ce~^^ while phases without these limits have less 
probability-coefficients. The probability that the phase at 
the time t' falls within the limits — A: is the same aS the 
probability that it falls within the limits F^’ = A: at the time 
since either event necessitates the other. This probability 
may be evaluated as follows. We may omit the accents, as 
we need only consider a single time. Let us denote the ex¬ 
tension-in-phase within the limits by C7, and the prob¬ 

ability that tlie phase falls within these limits by JS, also the 
extension-in-phase within the limits jP= 1 by ?7j. We have 
then by definition 

F=k 

If dpi . , . dqn9 


( 68 ) 
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F=zk 

e-' dp^... dq„, (59) 

F=:l 

=/. • fdp,... dq,. (60) 

But since is a homogeneous quadratic function of the 
differences 

Pi — A, P2 — Piy Qn, 

we have identically 

F=k 

J. . J d(p, - P.) • • • d(q, - <?,) 

kF^k 

=/•••/ d(p, - P.) . .. d(q„ - Q,) 

F=zl 

= k'J. . .Jd(p, - PO . .. d(q^ - Q,). 


That is 

II 

(61) 

whence 

dU = UiU dk. 

(62) 

But if k varies, equations (58) and (59) give 



F=Lk~\-dk 



dU j . , , j dpi •. . dq^ 

(63) 


Fz^k 



F—kr\-dk 



dR=.J.. .JGe-’^dp, ...dq^ 

(64) 


F=k 

Since the factor Ce~^ has the constant value C*e~* in the 
last multiple integral, we have 

dR = Ce-^dU- C U,n (65) 

P = - CU^\n e-* + * + ^* + . .. + + const. (66) 

We may determine the constant of integration by the condition 
that B vanishes with k. This gives 
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Jl=CU^ln-CV[lne-‘:(l + k + ^+ .. . + (67) 

We may determine the value of the constant by the con¬ 
dition that jB = 1 for A = oo. This gives (7 C/j |_n = 1, and 

= + t + + (68) 

It is worthy of notice that the form of these equations de¬ 
pends only on the number of degrees of freedom of the system, 
being in other respects independent of its dynamical nature, 
except that the forces must be functions of the coordinates 
either alone or with the time. 

If we write 

for the value of k which substituted in equation (68) will give 
i2 = the phases determined by the equation 

(70) 

will have the following properties. 

The probability that the phase falls wdthin the limits formed 
by these phases is greater than the probability that it falls 
within any other limits enclosing an equal extension-in-phase. 
It is equal to the probability that the phase falls without the 
same limits. 

These properties are analogous to those which in the theory 
of errors in the determination of a single quantity belong to 
values expressed by A± wh^n A is the most probable 
value and a the ‘probable error.’ 
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APPLICATION OF THE PRINCIPLE OF CONSERVATION OF 
EXTENSION-IN~FHASE TO THE INTEGRATION OF THE 
DIFFERENTIAL EQUATIONS OF MOTION * 


We have seen that the principle of conservation of exten- 
sion-in-phase may be expressed as a differential relation be¬ 
tween the coordinates and momenta and the arbitrary constants 
of the integi'al equations of motion. Now the integration of 
the differential equations of motion consists in the determina¬ 
tion of these constants as functions of the coordinates and 
momenta with the time, and the relation afforded by the prin¬ 
ciple of conservation of extension-in-phase may assist us in 
this determination* 

It will be convenient to have a notation which shall not dis¬ 
tinguish between the coordinates and momenta. If we write 
for the cotirdinates and momenta, and a ... A as be¬ 
fore for the arbitrary constants, the principle of which we 
wish to avail ourselves, and which is expressed by equation 
(37), may be written 


d(a, 


^2n) 

^0 


= func. (a, ... A). 


(71) 


Let us first consider the case in which the forces are deter¬ 
mined by the coordinates alone. Whether the forces are 
‘ conservative ’ or not is immaterial. Since the differential 
equations of motion do not contain the time (f) in the finite 
form, if we eliminate dt from these equations, we obtain 2/1 — 1 
equations in , ... r^n and their differentials, the integration 
of which will introduce 2 /i — 1 arbitrary constants which we 
shall call b ... h. If we can effect these integrations, the 


♦ See Boltzmann : “ Zusammenhang zwischen den S&tzen iiber das Ver- 
halten mehratomiger Gasmoleciile niit Jacobies Princip dee letzten Multi- 
plicators. Sitzb. der Wiener Akad.,Bd. LXIII, Abth. IL, S. 679, (1871). 
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remaining constant will then be introduced in the final 
integration, (viz., that of an equation containing and will 
be added to or subtracted from t in the integral equation. 
Let us have it subtracted from L It is evident then that 



(72) 


Moreover, since ... h and ^ — a are independent functions 
of rj, . . . r2„, the latter variables are functions of the former. 
The Jacobian in (71) is therefore function of by ... and 
t — a, and since it does not vary with i it cannot vary with a. 
We have therefore in the case considered, viz., where the 
forces are functions of the coordinates alone, 

Now let us suppose that of the first 2n — 1 integrations we 
have accomplished all but one, detennining 2 n — 2 arbitrary 
constants (say c, ... h} as functions of , . . . r2n, leaving b as 
well as a to be determined. Our 2n — 2 finite equations en¬ 
able us to regard all the variables and all functions 

of these variables as functions of two of them, (say and r2,) 
with the arbitrary constants Cy ... h. To determine by we 
have the following equations for constant values ol c, ... h. 


avo , dr^ „ 
da A- dby 
da db 


whence 


^2) 

_—— do — dvi + -j~ wr2. 

a (a, b) da da 


Now, by the ordinary formula for the change of variables, 


J J d(a, 


da db dr. .. . dr. 
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where the limits of the multiple integrals are formed by the 
same phases. Hence 


d{r^, r^ ) _ d(r i, ... J d (c, .. . h) 
dia, b) d(aj ... A) d{r^, . ,. 


With the aid of this equation, which is an identity, and (72), 
we may write equation (74) in the fonn 


d(r^, .. .ra„) d(e, 

d((ly . . . d{Tf^y 


^ 2 n) 


db = dfx — Tx dr^* 


(76) 


The separation of the variables is now easy. The differen¬ 
tial equations of motion give and in terms of , . . . 
The integral equations already obtained give . . . h and 
therefore the Jacobian d{c^ . . . h)jd(r^^ . . . ^ 2 ,,), in terms of 
the same variables. But in virtue of thesti same integial 
equations, we may regard functions of , . . . as functions 
of rj and witli the constants , h. If therefore we write 
the equation in the form 


d(rxy 

d(a, 


r2n) 


h) 


db 


d{c, ... A) 
d(r^, . .. r,,n) 


dri — 


d(c, h) 

d{n, • • • 


dr^ 


(77) 


the coefficients of dr^ and dr^ may be regarded as known func¬ 
tions of an(3 with the constants c, . . . A. Tlie coeffi(uent 
of db is by (73) a function of A, . . . A. It is not indeed a 
known function of tliese quantities, but since c, . . . A are 
regarded as constant in the equation, we know that the first 
member must represent the differential of some function of 
A,... A, for which we may write A'. We have thus 


^— TT dvx - r, —^— "TT dr^, 

d{c, ... A) d{cx, ... A) (78) 

d(ri, ...ran) d{rsj •. • r^n) 

which may be integrated by quadratures and gives A' as func¬ 
tions of r^, 7 * 2 , ... c, ... A, and thus as function of r^, . . . r 2 „. 
This integration gives us the last of the arbitrary constants 
which are functions of the coordinates and momenta without 
the time. The final integration, which introduces the remain- 
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ing constant (a), is also a quadrature, since the equation to 
be integrated may be expressed in the form 

dt^ F (ti) dri. 

Now, apart from any such considerations as have been ad¬ 
duced, if we limit ourselves to the changes which take place 
in time, we have identically 

rj dri — Vi dr2 — 0, 

and rj and are given in terms of rj,... by the differential 
equations of motion. When we have obtained 2 n — 2 integral 
equations, we may regard and as known functions of 
and Tg. The only remaining difficulty is in integrating this 
equation. If the case is so simple as to present no difficulty, 
or if we have the skill or the good fortune to perceive that the 
multiplier 

1 

d{c,...h) ^ (79) 

• • • ^2n) 

or any other, will make the first member of the equation an 
exact differential, we have no need of the rather lengthy con¬ 
siderations wliich have been adduced. The utility of the 
principle of conservation of extension-in-phase is that it sup¬ 
plies a ‘ multiplier ’ which renders the equation integrable, and 
which it might be difficult or impossible to find otherwise. 

It will be observed that the function represented by V is a 
particular case of that represented by h. The system of arbi¬ 
trary constants c... h has certain properties notable for 
simplicity. If we write b' for b in (77), and compare the 
result with (78), we get 

d(ri . . . 
d(a, b^, c, ... h) 

Therefore the multiple integral 


( 80 ) 



80 CONSERVATION OF EXTENSION-IN-PHASE 


taken within limits formed by phases regarded as contempo¬ 
raneous represents the extension-in-phase within those limits. 

The case is somewhat different when the forces are not de¬ 
termined by the coordinates alone, but are functions of the 
coordinates with the time. All the arbitrary constants of tlie 
integral equations must then be regarded in the general case 
as functions of ... r 2 „, and t. We cannot use the princi¬ 
ple of conservation of extension-iii-phase until we have made 
2n — l integrations. Let us suppose that the constants h, ... h 
have been determined by integration in terms of ... r^ni and 
t, leaving a single constant (a) to be thus detennined. Our 
2 w — 1 finite equations enable us to regard all the variables 
rj,... as functions of a single one, say r^. 

For constant values of J,... A, we have 


, dri 


(82) 


Now 


f" ■ ■ da dr^. . . dri„=J.. .Jdr^ . .. d 


where the limits of the integrals are formed by the same 
phases. We have therefore 


*ji _ ■ r.,„) d{h, ... A) 

da d{a, . . . h) dir^, . . . 


(83) 


by which equation (82) may be reduced to tlie form 


d{a, 



—L—dr - —II— 

d{b, ...h) d{ h, ...h) 

...rj,) . . . rj,) 


dt. 


(84) 


Now we know by (71) that the coefficient of da is a func¬ 
tion of a,... A. Therefore, as 6, ... A are regarded as constant 
in the equation, the first number represents the differential 
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oi a function of a, ...h, which we may denote by a'. We 
have then 

^ • • • ^) ~ ■ h) (85) • 

d{r^, ...r.^n) d{r^,...r^^) 

which may be integrated by quadratures. In this case we 
may say that the principle of conservation of extension-in- 


phase lias supplied the ‘ multiplier ’ 

•* 


1 

d{b, ... A) 

(86) 

d(T^, ... ?j,) 


for the integration of the equation 


dr, — r, d< = 0. 

(87) 

The system of arbitrary constants a', 6,. 

. ,h has evidently 

the same properties which were noticed 

in regard to the 

system a, h',.., h. 




CHAPTER IV. 


OX THE DISTRIBUTION IN PHASE CALLED CANONICAL, 
IN WHICH THE INDEX OP PROBABILITY IS A LINEAR 
FUNCTION OF THE ENERGY. 


Let us now give our attention to the statistical equilibrium 
of ensembles of conservation systems, especially to those cases 
and properties wliich promise to throw light on the phenom¬ 
ena of thermodynamics. 

The condition of statistical equilibrium may be expressed 
in the fonn* 


^/dP. dP 
\dpi dq. 




( 88 ) 


where P is the coefficient of probability, or the quotient of 
the density-in-phase by the whole number of systems. To 
satisfy this condition, it is necessary and sufficient that P 
should be a function of the jo’s and qs (the momenta and 
coordinates) which does not vary with the time in a moving 
system. In all cases which we are now considering, the 
energy, or any function of the energy, is such a function. 


P = func. (e) 


will therefore satisfy the equation, as indeed appears identi¬ 
cally if we write it in the form 


dP de dP \ _ Q 

dqi dpy dpi dqx) 


There are, however, other conditions to which P is subject, 
which are not so much conditions of statistical equilibrium, as 
conditions implicitly involved in the definition of the coeffi- 


* See equations (20), (41), (42), also the paragraph following equation (20). 
The positions of any external boflies which can affect the systems are here 
supposed uniform for all the systems and constant in time. 
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cient of probability, whether the case is one of equilibrium 
or not. These are: that P should be single-valued, and 
neither negative nor imaginary for any phase, and that ex¬ 
pressed by equation (46), viz., 

aU 

If Fdpi . . . dqn = 1. (89) 

phases 

These considerations exclude 

P = € X constant, 

as well as 

P — constant, 
as cases to be considered. 


The distribution represented by 


r) = logF = '^ ^ , 

(90) 




(91) 


where @ and yfr are constants, and @ positive, seems to repre¬ 
sent the most simple case conceivable, since it has the property 
that when the system consists of parts with separate energies, 
the laws of the distribution in phase of the separate parts are 
of the same nature,— a property which enormously simplifies 
the discussion, and is the foundation of extremely important 
relations to thermodynamics. The case is not rendered less 
simple by the divisor ®, (a quantity of the same dimensions as 
€,) but the reverse, since it makes the distribution independent 
of the units employed. The negative sign of e is required by 
(89), which determines also the value of yjr for any given 
0, viz.. 



When an ensemble of systems is distributed in phase in the 
manner described, i. when the index of probability is a 
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linear function of the energy, we shall say that the ensemble is 
canonically distributed^ and shall call the divisor of the energy 
(0) tlie modulus of distribution. 

The fractional part of an ensemble canonically distributed 
which lies within any given limits of phase is therefore repre¬ 
sented by the multiple integral 

J. . .Je^ dp,...dq, (93) 

taken within those limits. We may express the same thing 
by saying that the multiple integral expresses the probability 
that an unspecified system of the ensemble (i, e,y one of 
which we only know that it belongs to the ensemble) falls 
within the given limits. 

Since the value of a multiple integral of the form (23) 
(which we have called an extension-in-phase) bounded by any 
given phases is independent of the system of coordinates by 
which it is evaluated, the same must be true of the multiple 
integral in (92), as appears at once if we divide up this 
integral into parts so small that tlie exponential factor may be 
regarded as constant in each. The value of is therefore in¬ 
dependent of the system of coordinates employed. 

It is evident that might be defined as the energy for 
which the coefficient of probability of phase has the value 
unity. Since however this coefficient has tlie dimensions of 
the inverse nth power of the product of energy and time,* 
the energy represented by yjr is not independent of the units 
of energy and time. But when these units have been chosen, 
the definition of yfr will involve the same arbitrary constant as 
6, so that, while in any given case the numerical values of 

or € will be entirely indefinite until the zero of energy has 
also been fixed for the system considered, the difference — c 
will represent a perfectly definite amount of energy, which is 
entirely independent of the zero of energy which we may 
choose to adopt. 


See Chapter I, p. 19. 
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It is evident that the canonical distribution is entirely deter¬ 
mined by the modulus (considered as a quantity of energy) 
and the nature of the system considered, since when equation 
(92) is satisfied the value of the multiple integral (93) is 
independent of the units and of the coordinates employed, and 
of the zero chosen for the energy of the system. 

In treating of the canonical distribution, we shall always 
suppose the multiple integral in equation (92) to have a 
finite value, as otherwise the coefficient of probability van¬ 
ishes, and the law of distribution becomes illusory. This will 
exclude certain cases, but not such apparently, as will affect 
the value of our results with respect to their bearing on ther¬ 
modynamics. It will exclude, for instance, cases in which the 
system or parts of it can be distributed in unlimited space 
(or in a space which has limits, but is still infinite in volume), 
while the energy remains beneath a finite limit. It also 
excludes many cases in which the energy can decrease without 
limit, as when the system contains material points which 
attract one another inversely as the squares of their distances. 
Cases of material points attracting each other inversely as the 
distances would be excluded for some values of 0, and not 
for others. The investigation of such points is best left to 
the particular cases. I'’or the purposes of a general discussion, 
it is sufficient to call attention to the assumption implicitly 
involved in the formula (92).* 

The modulus @ has properties analogous to those of tem¬ 
perature in thermodynamics. Let the system A be defined as 
one of an ensemble of systems of m degrees of freedom 
distributed in phase with a probability-coefficient 

e e , 

♦ It will be observed that similar limitations exist in thermodynamics. In 
order that a mass of gas can be in thermodynamic equilibrium, it is necessary 
that it be enclosed. There is no thermodynamic equilibrium of a (finite) mass 
of gas in an infinite space. Again, that two attracting particles should be 
able to do an infinite amount of work in passing from one configuration 
(which is regarded as possible) to another, is a notion which, although per¬ 
fectly intelligible in a mathematical formula, is quite foreign to our ordinary 
conceptions of matter. 
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and the system B as one of an ensemble of systems of n 
degrees of freedom distributed in phase with a probability- 
coeflBicient 

e e , 

which has the same modulus. Let g'j, . . Pv • • Pm be the 
coordinates and momenta of and , ... Pm+i ? • • • Pm+n 

those of j5. Now we may mgard the systems A and B as 
together forming a system C, having m + w degrees of free¬ 
dom, and the coordinates and momenta Pv • Pm-^n* 

The probability that the phase of the system (7, as thus defined, 
will fall within the limits 

dpi , . . . dq^j ... dq^^ 


is evidently the product of the probabilities that the systems 
A and B will each fall within the specified limits, viz., 


6 0 dpy^. . . dp^,^^ dq^. . . dq,^j^. 


(94) 


We may therefore regard (7 as an undetermined system of an 
ensemble distributed with the probability-coefficient 




e 


e 


j 


(95) 


an ensemble which might be defined as formed by combining 
each system of the first ensemble with each of the second. 
But since 6^ + is the energy of the whole system, and 
and yjr ^ are constants, the probability-coefficient is of the 
general form which we are considering, and the ensemble to 
which it relates is in statistical equilibrium and is canonically 
distributed. 

This result, however, so far as statistical equilibrium is 
concerned, is rather nugatory, since conceiving of separate 
systems as forming a single system does not create any in¬ 
teraction between them, and if the systems combined belong to 
ensembles in statistical equilibrium, to say that the ensemble 
formed by such combinations as we have supposed is in statis¬ 
tical equilibrium, is only to repeat the data in different 
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words. Let us therefore suppose that in forming the system 
C we add certain forces acting between A and JB, and having 
the force-function — €^ 3 . The energy of the system C is now 
2 -^ ensemble of such systems distributed 
with a density proportional to 


e e 


(96) 


would be in statistical equilibrium. Comparing this with the 
probability-coefficient of C given above (95), we see that if 
we suppose (or rather the variable part of this term when 
we consider all possible configurations of the systems A and B) 
to be infinitely small, the actual distribution in phase of C 
will differ infinitely little from one of statistical equilibrium, 
which is equivalent to saying that its distribution in phase 
will vary infinitely little even in a time indefinitely prolonged.* 
The case would be entirely different if A and B belonged to 
ensembles having different moduli, say 0^ and The prob¬ 
ability-coefficient of C would then be 




(97) 


which is not approximately proportional to any expression of 
the form (96). 

Before proceeding farther in the investigation of the dis¬ 
tribution in phase which we have called canonical, it will be 
interesting to see whether the properties with i-espect to 


♦ It will be observed that the above condition relating to the forces which 
act between the different systems is entirely analogous to that which must 
hold in the corresponding case in thermodynamics. The most simple test 
of the equality of temperature of two bodies is that they remain in equilib¬ 
rium when brought into thermal contact. Direct thermal contact implies 
molecular forces acting between the bodies. Now the test will fail unless 
the energy of these forces can be neglected in comparison with the other 
energies of the bodies. Thus, in the case of energetic chemical action be¬ 
tween the bodies, or when the number of particles affected by the forces 
acting between the bodies is not negligible in comparison with the whole 
number of particles (as when the bodies have the form of exceedingly thin 
sheets), the contact of bodies of the same temperature may produce con¬ 
siderable thermal disturbance, and thus fail to afford a reliable criterion of 
the equality of temperature. 
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statistical equilibrium which have been described are peculiar 
to it, or whether other distributions may have analogous 
properties. 

Let f}' and be the indices of probability in two independ¬ 
ent ensembles which are each in statistical equilibrium, then 
r)^ 4 - 7 ;" will be the index in the ensemble obtained by combin¬ 
ing each system of the first ensemble with each system of the 
second. This third ensemble will of course be in statistical 
equilibrium, and the function of phase 77 ' -f 77 " will be a con¬ 
stant of motion. Now when infinitesimal forces are added to 
the compound systems, if 77 ^ -f 77 '' or a function differing 
infinitesimally from this is still a constant of motion, it must 
be on account of the nature of the forces added, or if their action 
is not entirely specified, on account of conditions to which 
they are subject. Thus, in the case already considered, 
77' -f 77'^ is a function of the energy of the compound system, 
and the infinitesimal forces added are subject to the law of 
conservation of energy. 

Another natural supposition in regard to the added forces 
is that they should be such as not to affect the moments of 
momentum of the compound system. To get a case in which 
moments of momentum of the compound system shall be 
constants of motion, we may imagine material particles con¬ 
tained in two concentric spherical shells, being prevented from 
passing the surfaces bounding the shells by repulsions acting 
always in lines passing through the common centre of the 
shells. Then, if there are no forces acting between particles in 
different shells, the mass of particles in each shell will have, 
besides its energy, the moments of momentum about three 
axes through the centre as constants of motion. 

Now let us imagine an ensemble formed by distributing in 
phase the system of particles in one shell according to the 
index of probability 


A 



Oi Oa 




(98) 


where 6 denotes the energy of the system, and ©j, cDj , cDg, its 
three moments of momentum, and the other letters constants. 
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In like manner let us imagine a second ensemble formed by 
distributing in phase the system of particles in the other shell 
according to the index 


@ ^ Oi ^ ^ O,' 


(99) 


where the letters have similar significations, and 0 , flj, ^8 
the same values as in the preceding formula. Each of the 
two ensembles will evidently be in statistical equilibrium, and 
therefore also the ensemble of compound systems obtained by 
combining each system of the first ensemble with each of the 
second. In this third ensemble the index of probability will be 


^ 4- yl' ~ 


€ ^ t‘>l + . a>2 + ti>3' <*>8 “h 


0 


+ 


a 


+ 


14 


+ 


14 


( 100 ) 


where the four numerators represent functions of phase which 
are constants of motion for the compound systems. 

Now if we add in each system of this third ensemble infini¬ 
tesimal conservative forces of attraction or repulsion betwera 
particles in different shells, determined by the same law for 
all the systems, the functions eo^ -f o)\ 0)3 -}- © 3 ', and + < 03 ' 
will remain constants of motion, and a function differing in¬ 
finitely little from -f- e' will be a constant of motion. It 
would therefore require only an infinitesimal change in the 
distribution in phase of the ensemble of compound systems to 
make it a case of statistical equilibrium. These properties are 
entirely analogous to those of canonical ensembles.* ** 

Again, if the relations between the forces and the coordinates 
can be expressed by linear equations, there will be certain 
normal ” t} 7 )e 8 of vibration of which the actual motion may 
be regarded as composed, and the whole energy may be divided 


* It would not be poesible to omit the term relating to energy in the aboTe 
indiceB, since without this term the condition expressed by equation (89) 
cannot be satished. 

The consideration of the above case of statistical equilibrium may be 
made the foundation of the theory of the thermodynamic equilibrium of 
rotating bodies, — a subject which has been treated by Maxwell in his memoir 

** On Boltzmann's theorem on the average distribution of energy in a system 
of material points." Cambr. Phil. Trans., toI. XII, p. 547, (1878). 
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into parts relating separately to vibrations of these different 
types. These partial energies will be constants of motion, 
and if such a system is distributed according to an index 
which is an}" function of the partial energies, the ensemble will 
te in statistical equilibrium. Let the index be a linear func¬ 
tion of the partial energies, say 


A 




( 101 ) 


Let us suppose that we have also a second ensemble com¬ 
posed of systems in which the forces are linear functions of 
the coordinates, and distributed in phase according to an index 
which is a linear function of tlie partial energies relating to 
the normal types of vibration, say 



( 102 ) 


Since the two ensembles are both In statistical equilibrium, 
the ensemble formed by combining each system of the first 
with each system of the second will also be in statistical 
equilibrium. Its distribution in phase will be represented by 
the index 


A' 





(103) 


and the partial energies represented by the numerators in the 
formula will be constants of motion of the compound systems 
which form this third ensemble. 

Now if we add to these compound systems infinitesimal 
forces acting between the component systems and subject to 
the same general law as those already existing, viz., that they 
are conservative and linear functions of the coordinates, there 
will still be n -f w types of normal vibration, and n m 
partial energies which are independent constants of motion. 
If all the original w -p w normal types of vibration have differ¬ 
ent periods, the new types of normal vibration will differ infini¬ 
tesimally from the old, and the new partial energies, which are 
constants of motion, will be nearly the same functions of 
phase as the old. Therefore the distribution in phase of the 
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ensemble of compound systems after the addition of the sup¬ 
posed infinitesimal forces will differ infinitesimally from one 
which would be in statistical equilibrium. 

The case is not so simple when some of the normal types of 
motion have the same periods. In this case the addition of 
infinitesimal forces may completely change the normal types 
of motion. But tlie sum of the partial energies for all the 
original t 3 rpes of vibration whicli have any same period, will 
be nearly identical (as a function of phase, e.y of the coordi¬ 
nates and momenta,) with the sum of the partial energies for 
the normal types of vibmtion which have the same, or nearly 
the same, period after the addition of the new forces. If, 
therefore, the partial energies in the indices of the first two 
ensembles (101) and (102) which relate to types of vibration 
having the same periods, have the same divisors, the same will 
be true of the index (103) of the ensemble of compound sys¬ 
tems, and the distribution represented will differ infinitesimally 
from one which would be in statistical equilibrium after the 
addition of the new forces.* 

The same would he true if in the indices of each of the 
original ensembles we should substitute for the term or terms 
relating to any period which does not occur in the other en¬ 
semble, any function of the total energy related to that period, 
subject only to the general limitation expressed by equation 
(89). But in order that the ensemble of compound systems 
(with the added forces) shall always be approximately in 
statistical equilibrium, it is necessary that the indices of the 
original ensembles slmuld be linear functions of those partial 
energies which relate to vibrations of periods common to tlie 
two ensembles, and that the coefficients of such partial ener¬ 
gies should be the same in the two indices.f 

* It is interesting to compare the above relations with the laws respecting 
the exchange of energy between bodies by radiation, although the phenomena 
of radiations lie entirely without the scope of the present treatise, in which 
the discussion is limited to systems of a finite number of degrees of freedom. 

t The above may perhaps be sufficiently illustrated by the simple case 
where n = 1 in each system. If the periods are different in the two systems, 
they may be distributed according to any functions of the energies: but if 
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The properties of canonically distributed ensembles of 
systems with respect to the equilibrium of the new ensembles 
which may l)e formed by combining each system of one en¬ 
semble with each system of another, are therefore not peculiar 
to tliem in the sense that analogous properties do not belong 
to some other distributions under special limitations in regtiid 
to the systems and forces considered. Yet the canonical 
distribution evidently constitutes the most simple case of the 
kind, and that for which the relations described hold with the 
least restrictions. 

Returning to the case of the canonical distribution, we 
shall find other analogies with thermodynamic systems, if we 
suppose, as in the preceding chapters,♦ that the potential 
energ}^ (e^) depends not only upon the coordinates . (]„ 
which determine tlie configuration of the system, but also 
upon certain coordinates Ui, aa, etc. of bodies which we call 
external, meaning by this simply tiiat they are not to be re- 
grirded as forming any part of the system, although tlieir 
positions affect the forces which act on tlie system. The 
forces exerted by the system upon these external bodies will 
be represented by — degjda^, — deglda^^, etc., while — deqjdq^, 

. . . — d^gjdqn represent all the forces acting upon the bodies 
of the system, including those which depend upon the position 
of the external bodies, as well as those which depend only 
upon the configuration of the system itself. It will be under¬ 
stood that €p depends only upon ,... , jpi,... , in other 

words, that the kinetic energy of the bodies which we call 
external forms no part of the kinetic energy of the system. 
It follows that we may write 


dai dai 


A. 


(104) 


although a similar equation would not hold for differentiations 
relative to the internal cooidinates. 


the periods are the same they must be distributed canonically with same 
modulus in order that the compound ensemble with additional forces may 
be in statistical equilibrium. 

* See especially Chapter I, p. 4. 
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We always suppose these external cotirdinates to have the 
same values for all systems of any ensemble. In the case of 
a canonical distribution, L e., when the index of probability 
of phase is a linear function of the energy, it is evident that 
the values of the external cooixlinates will affect the distribu¬ 
tion, since they affect the energy. In the equation 

^ all C 

€ ® = j'. . .J^e dq^, (105) 

phaaes 

by which yjr may be detennined, the external coordinates, , 
etc., contained implicitly in €, as well as 0, are to be re¬ 
garded as con still! t in the integrations indicated. The equa¬ 
tion indicates that is a function of these constants. If we 
imagine tlieir values varied, and the ensemble distributed 
canonically acconling to tlieir new Values, we have by 
differentiation of the equation 

^ / \ -4 ^ 

e + ^dp,...dq, 

phfUMM 

phMes 

all f 

— ^ da,J'. . e ^ dpi .. .dq. — etc., (106) 

phaiea 

t 

or, multiplying by @ e®, and setting 


de 

dui 




de 
dci o 


Al^ y eto.^ 


aU ^ 


d<l,+ ^d& = ^d®J.. .Jee ® dpi .. . dq. 

pbaaea 

+■ •/ 


pbaaea 

t:! 

e ® dpi. . . dq. 


+ da. 


pbaaea 




A^e ^ dpi . .. + etc. (107) 
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Now the average value in the ensemble of any quantity 
(which we shall denote in general by a horizontal line above 
the proper symbol) is determined by the equation 

• .Jue (108) 

phases 


Comparing tliis with the preceding equation, we have 


dij/ 

= — ^ dai — Az da^ — etc. 

0 0 

(109) 

Or, since 

II 

1 0 

(110) 

and 

— € 

—®- = V’ 

(111) 


dip ~ rjd® — Ai dai •— A 2 dan — etc. 

(112) 

Moreover, since (111) gives 


we have also 

dip -- de — 0 di) -i- 7 / dS, 

(113) 


de ■=z B dr) — Ai dai — da^ — etc. 

(114) 


This equation, if we neglect the sign of averages, is identi¬ 
cal ill form with tlie thermodynamic e(piation 


or 


drf ■■ 


de A- dxxx + da^ -f- etc. 


de = T dr) — Ai da^ — da^ — etc., 


(115) 

(116) 


which expresses the relation between the energy, tempera¬ 
ture, and entropy of a body in thermodynamic equilibrium, 
and the forces which it exerts on external bodies, — a relation 
which is the mathematical expression of the second law of 
thermodynamics for reversible changes. The modulus in the 
shitistical equation corresponds to temperature in the thermo¬ 
dynamic equation, and the average index of probability with 
its sign reversed corresponds to entropy. But in the thenno- 
dynamic equation the entropy (i?) is a quantity which is 
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only defined by the equation itself, and incompletely defined 
in that the equation only determines its differential, and the 
constant of integration is arbitrary. On the other hand, the 
rj in the statistical equation has been completely defined as 
the average value in a canonical ensemble of systems of 
the logarithm of the coefficient of probability of phase. 

We may also compare equation (112) with the thermody¬ 
namic equation 

= — rjdT — Aidoi — AidUi — etc., (117) 

where represents the function obtained by subtracting the 
product of the temperature and entropy from the energy. 

How far, or in what sense, the similarity of these equations 
constitutes any demonstration of the thermodynamic equa¬ 
tions, or accounts for the behavior of material systems, as 
described in the theorems of thermodynamics, is a question 
of which we shall postpone the consideration until we have 
further investigated the properties of an ensemble of systems 
distributed in phase according to the law which we are con¬ 
sidering. The analogies which have been pointed out ■will at 
least supply the motive for this investigation, which will 
naturally commence with the determination of tlie average 
values in the ensemble of the most important quantities relating 
to the systems, and to the distribution of the ensemble with 
respect to the different values of these quantities. 



CHAPTER V. 


AVERAGE VALUES IN A CANONICAL ENSEMBLE 
OF SYSTEMS. 


In the simple but important case of a system of material 
points, if we use rectangular coordinates, we have for the 
product of the differentials of the coordinates 

dxy (hji dzi . . . dxp dyv dzyj 

and for the product of the differentials of the momenta 

nil ^y\ • niy dxy niy dyy rriy dzy. 


The product of these expressions, which represents an element 
of extension-in-phase, may be briefly written 

fTij dxi • • * niy dzy dxi • • • dzy ^ 


and the integral 



( 118 ) 


\vill represent the probability that a system taken at random 
from an ensemble canonically distributed will fall within any 
given limits of phase. 

In this case 


and 


€ = j mi . . . + J Zy\ (119) 

til ^l!? 

e ® r 2® • • • 6 ^^^ (120) 


The potential energy (e^) is independent of the velocities, 
and if the limits of integration for the coordinates are inde¬ 
pendent of the velocit'es, and the Umits of the several veloci¬ 
ties are independent of each other as well as of the coordinates, 
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the multiple integral may be resolved into the product of 
integrals 


/•■•/ 




dz, 


■/' 


rriidxi 




nmydzv 


( 121 ) 


This shows that the probability that the configumtion lies 
within any given limits is independent of the velocities, 
and that the probability that any component velocity lies 
within any given limits is independent of the other component 
velocities and of the configuration. 

Since 


and 



^ 1 ^ 1 ^ _ 

^ ® dXx — TT Wj 



'm^Xi e 'nixdxx^ \/\TrmxCAj 


( 122 ) 


(123) 


the average value of the part of the kinetic energy due to tlie 
velocity x^y wliich is expressed by the quotient of these inte¬ 
grals, is I @. This is true whether the average is taken for 
the .whole ensemble or for any particular configuration, 
whether it is taken \vithout reference to the other component 
velocities, or only tliose systems are considered in which the 
other component velocities have particular values or lie 
within specified limits. 

The number of coordinates is Si' or ti. We have, therefore, 
for the average value of the kinetic energy of a system 

= (124) 

This is equally true whether we take the average for the whole 
ensemble, or limit the average to a single configuration. 

The distribution of the systems with respect to their com¬ 
ponent velocities follows the ‘ law of errors ’; the probability 
that the value of any component velocity lies within any given 
limits being represented by the value of the corresponding 
integral in (121) for those limits, divided by (2 7rm@)*, 
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which is the value of the same integral for infinite limits. 


Thus the probability that the value of lies between any 
given limits is expressed by 



(125) 


The expression becomes more simple when the velocity is 
expressed vdth reference to tlie energy involved. If we set 



the probability that s lies between 
expressed by 



2 

ds. 


any given limits is 


(126) 


Here s is the ratio of the component velocity to that which 
would give the energy @; in other words, is the quotient 
of the energy due to the component velocity divided by 
The distribution with respect to the partial energies due to 
the component velocities is therefore the same for all the com¬ 
ponent velocities. 

The probability that the configuration lies within any given 
limits is expressed by the value of 

c ® dxj . . . dz, (127) 



for those limits, where M denotes the product of all the 
masses. This is derived from (121) by substitution of the 
values of the integrals relating to velocities taken for infinite 
limits. 

Very similar results may be obtained in the general case of 
a conservative system of n degrees of freedom. Since is a 
homogeneous quadratic function of the 2 ?’s, it may be divided 
into parts by the formula 

1 1 ^p 
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where e might be written for ej, in the differential coefficients 
without affecting the signification. The average value of the 
first of these parts, for any given configuration, is expressed 
by the quotient 



Now we have by integration by parts 

U/—f tl/—€ 

- -- 

J ^ -J ^ 

By substitution of this value, the above quotient reduces to 

which is therefore the average value of \p\~ for the 

given configuiation. Since this value is independent of the 
configuration, it must also be the avemge for the whole 
ensemble, as might easily be proved directly. (To make 
the preceding proof apply directly to the whole ensemble, we 
liave only to write dp^ . . . for dp^ . . . dp^ in the multiple 
integrals.) This gives ^ ?i 0 for the average value of the 
whole kinetic energy for any given configuration, or for 
the whole ensemble, as has already been proved in the case of 
material points. 

The mechanical significance of the several parts into which 
the kinetic energy is divided in equation (128) will be appar¬ 
ent if we imagine that by the apphcation of suitable forces 
(different from those derived from and so much greater 
that tlie latter may be neglected in comparison) the system 
was brought from rest to the state of motion considered, so 
rapidly that the configuration Avas not sensibly altered during 
the process, and in such a manner also that the ratios of the 
component velocities were coiishint in the process. If we 
write 


Fx dqi. . F^drp 
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for the moment of these forces, we have for the period of their 
action by equation (3) 




dqi 


de 

7" + = • 

dqi 


de 

dqi 




The work done by the force may be evfiluated as follows : 






where the last term may be cancelled because the configuration 
does not vary sensibly during the application of the forces. 
(It will be observed that the other terms contain fac^tors which 
increase as the time of the action of the forces is diminished.) 
We have therefore, 


J dqi = J Pi qidt- I' qi dp Pi dpi . (131) 

For since the p's are linear functions of the q's (with coeffi¬ 
cients involving the ^’s) the supposed constancy of the q's and 
of the ratios of tlie q's will make the ratio qi/pi constant. 
The last integral is evidently to be taken between the limits 
zero and the value of in the phase originally considered, 
and the quantities before the integral sign may be taken as 
relating to that phase. We have therefore 

1 dqi = . (132) 

That is: the several parts into wliich the kinetic energy is 
divided in equation (128) represent the amounts of energy 
communicated to the system by the several forces , . . . 
under the conditions mentioned. 

The following transformation will not only give the value 
of the aveitige kinetic energy, but will also serve to separate 
the distribution of the ensemble in configuration from its dis¬ 
tribution in velocity. 

Since 2 6^ is a homogeneous quadratic function of the jt?’s, 
which is incapable of a negative value, it can always be ex¬ 
pressed (and in more than one way) as a sum of squares of 
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linear functions of the The coefficients in these linear 

functions, hke those in the quadratic function, must be regarded 
in the general case as functions of the q%. Let 


2 . .. -f (133) 

where are such linear functions of the ^’s. If we 

write 

fij lh » » • Pn) 

d{^u^ . . . wJ 


for the Jacobian or determinant of the differential coefficients 
of the form dpidu., we may substitute 


d( Pi ♦ ■ 
d{u^ . . 



1 


. . du^ 


for . . . dp^ 

under the multiple integral sign in any of our formulae. It 
will be ol)served that this determinant is function of the j's 
alone. The sign of such a determinant depends on the rela¬ 
tive order of the variables in the numerator and denominator. 
But since the suffixes of the are only used to distinguish 
these functions from one another, and no especial relation is 
supposed between a p and a u which have the same suffix, we 
may evidently, without loss of generality, suppose the suffixes 
so applied that the detennimint is positive. 

Since the w’s are linear functions of the jp’s, when the in¬ 
tegrations are to cover all values of the />’s (for constant q^) 
once and only once, they must cover all values of the once 
and only once, and the limits will be ± oo for all the w’s. 
Without the supposition of the last paragraph the upper limits 
would not always be + oo, as is evident on considering the 
effect of changing the sign of a u. But with the supposition 
which we have made (that the determinant is always positive) 
we may make the upper limits + oo and the lower — oo for all 
the u's. Analogous considerations will apply where the in¬ 
tegrations do not cover all values of the p’s and therefore of 


* The reduction requires only the repeated application of the process of 
‘completing the square' used in the solution of quadratic equations. 
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the UB, The integnils may always be taken from a less to a 
greater value of a u- 

The general integiul which expresses the fractional part of 
the ensemble which falls within any given limits of phase is 
thus redacted to the fonn 


¥ 




d(Ui . 


* • 




du ^.. . du,^ dq ^.. . dqn> (f 34) 


For the average value of the part of the kinetic energy 
which is represented by | whether the average is taken 
for the whole ensemble, or for a given conliguration, we have 
therefore 


/ -f- 00 — -- 

oc 

4 =- - -; 


I 




20 


(/Uj 


{^7r0®)i _ 0 

(2^074 ~ 2 ’ 


(135) 


and for the average of the whole kinetic energy, |n©, as 
before. 

Tlie fi'fLCtional part of the ensemble which lies within any 
given hmits of conJifjuratio 7 iy is found by integrating ( 134 ) 
with respect to the us from — 00 to -f 00 . This gives 



which shows that the value of the Jacobian is independent of 
the manner in which 2ep is divided into a sum of squares. 
We may verify this directly, and at the same time obtain a 
more convenient expression for the Jacobian, as follows. 

It mil be observed that since the w’s are linear functions of 
the ^’s, and the p's linear functions of the q's^ the u's will be 
linear functions of the q's, so that a differential coefficient of 
the form dujdq will be independent of the 9’s, and function of 
the q's alone. Let us write dp^jduy for the general element 
of the Jacobian determinant. We have 
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d d€ du^ 

— V-__j: 


dit^ dqg, du^ r—i <^^Ix 


Therefore 



ePe du. 


dc 

r=i \dUy du^ dq^ 

/ dq,, 

dif^ 

d{p, 

. . .P„) _ 

d{Uy . , . 

^n) 

d(u, 

. • . M„) 

d{q, . . . 

2n) 

■p,)\ 

_ (d(u, 

. . . ?/„) \ ' 


■^n)J 

\d{q. 

■ ■■q,.)) 

(J 


These determinants are all functions of the y’s alone.* The 
last is evidently the Hessian or determinant formed of the 
second differential coefficients of the kinetic energy with re¬ 
spect to , ... We shall denote it by The reciprocal 
determinant 

• • • 2>nf 


which is the Hessian of the kinetic energy regarded as func¬ 
tion of the p’s, w^e shall denote by A^. 

If we set 

4: * fp 


—00 —00 

=/■■/ 


e ® A, dp,... dp„ 


e 


. . . —«n® 
20 


dui . . . = (27r0)^, 


and i/-, = i// — i/fp, (141) 

• It will be observed that the proof of (137) depends on the linear relation 
• dur 

between the w^s and which makes -yr constant with respect to the differ- 

dqx 

entiations here considered. Compare note on p. 12. 
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the fractional part of the ensemble which lies within any 
given limits of configuration (136) may be written 



isT-h i 

* 


dqi - . . dq,. 


(142) 


where the constant may be determined by the condition 
tliat the integral extended over all configurations has the value 
unity." 


* In the simple but important case in which is independent of the q*8, 
and €g a quadratic function of the q’s, if we write (a for the least value of fg 
(or of f) consistent with the given values of the external coordinates, the 
equation determining may be written 

4-05 -f-oo_ 

« ® ® dqi...dgn. 

- 00 - 00 


If we denote by 9 /, ... 7 / the values of 71 ,... 7 n which give fg its least value 
€a f it is evident that sg — «a is a homogenous quadratic function of the differ¬ 
ences 7 i — qi, etc., and that dqi ,... dqn may be regarded as the differentials 
of these differences. The evaluation of this integral is therefore analytically 
similar to that of the integral 

-f® H-®_ 

^dpi...dp„, 


for which we have found the value Ap 
by analogy, we get 


^ (2t 8)*. By the same method, or 


« V 1 


(2irBy 


where Ag is the Hessian of the potential energy as function of the q^s. It 
will be observed that Ag depends on the forces of the system and is independ¬ 
ent of the masses, while Ag or its reciprocal Ap depends on the masses and 
is independent of the forces. While each Hessian depends on the system of 
coordinates employed, the ratio Aq/Ag is the same for all systems. 

Multiplying the last equation by (140), we have 

* * = 6 ^/ <2wer. 

For the average value of the potential energy, we have 


4 ® -f-® ^ _ 

y*. . .y* (e, - ea) e ® dqi. . , dqn 


fg — Ca = 


-f-® -f-«o ^ *f *0 


f...ft • . .. rfj. 
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When an ensemble of systems is distributed in configura¬ 
tion in the maimer indicated in this formula, i. e.j when its 
distribution in configuration is the same as that of an en¬ 
semble canonically distributed in phase, we shall say, without 
any reference to its velocities, that it is canonically dutributed 
in configuration. 

For any given configuration, the fractional part of the 
systems which lie within any given limits of velocity is 
represented by the quotient of the multiple integral 


S-F ® dpi . . . dp^y 


or its equivalent 

/••/« 




^^dui . . . 


taken within those limits divided by the value of the same 
integral for the limits ± oo. But the value of the second 
multiple integral for the limits ± oo is evidently 


(27r0)*. 

We may therefore write 



(143) 


The evaluation of this expression is similar to that of 

-f-OO -f-flO _^ 

-f-00 -f"*® _ 

^dpi...dpn 

—- 00 —oo 

which expresses the average value of the kinetic energy, and for which we 
have foimd the value ^ « O* We have accordingly 

fg — Ca — 2 ^ 

Adding the equation ^ 

♦p = 2 ” 

€ — f o = » 0. 


we have 
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or 

P p V'p-*p 

J ’ J e ® ... dp^, 

(144) 

or again 

J " J ^ ■ •■dqn, 

(145) 


for the fractional part of the systems of any given configura¬ 
tion which lie within given limits of velocity. 

When systems are distributed in velocity according to tliese 
formulae, i. g., when the distribution in velocity is like that in 
an ensemble which is canonically distributed in phase, we 
shall say that they are canonically distributed in velocity. 

The fractional part of the whole ensemble which falls 
wdthin any given limits of phase, which we have before 
expressed in the form 

f-f ® dpi . . . dp„dq^ . - .dq^, (J46) 

may also be expressed in the form 



( 147 ) 



CHAPTER VI. 


EXTENSION IN CONFIGURATION AND EXTENSION 
IN VELOCITY. 

The formulae relating to canonical ensembles in the closing 
paragraplis of the last chapter suggest certain general notions 
and principles, wliich we shall consider in this chapter, and 
which are not at all limited in their application to tlie canon¬ 
ical law of distribution.* 

We have seen in Cliapter IV. that the nature of the distribu¬ 
tion which we have called canonical is independent of the 
system of coordinates by which it is described, being deter¬ 
mined entirely by the modulus. It follows that the value 
represented by the multiple integral (142), wliich is the frac¬ 
tional part of the ensemble whicli lies within certain limiting 
configurations, is independent of the system of coordinates, 
being detennined entirely by the limiting configurations with 
the modulus. Now yfr, as we liave already seen, represents 
a value wliich is independent of the system of coordinates 
by which it is defined. The same is evidently true of 
equation (140), and therefore, by (141), of 
Hence the exponential factor in the multiple integral (142) 
represents a value which is independent of the system of 
coonlinates. It follow's that the value of a multiple integral 
of the form 

j . .. J A,^dqi (148) 

* These notions and principles are in fact such as a more 1 optical arrange¬ 
ment of the subject would place in connection with those of Chapter I., to 
which they are closely related. The strict requirements of logical order 
have been sacrificed to the natural development of the subject, and very 
elementary notions have been left until they have presented themselves in 
the study of the leading problems. 
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is independent of the system of coordinates which is employed 
for its evaluation, as will appear at once, if we suppose the 
multiple integral to be broken up into parts so small that 
the exponential factor may be regarded as constant in each. 

In the same way the formulae (144) and (145) which express 
the probability that a system (in a canonical ensemble) of given 
configuration will fall within certain limits of velocity, show 
that multiple integrals of the form 



J' ■ ■ I 

(149) 

or 

S' ' 'S 

(160) 


relating to velocities possible for a given configuration, when 
the limits are formed by given velocities, have values inde¬ 
pendent of the system of coordinates employed. 

These relations may easily be verified directly. It has al¬ 
ready been proved that 

d(Pi, « « > Pn) _ d{q^ . . . q\) _ d{qi, . . . yj 


where 9i, ... » • • • and , ... P„ are two 

systems of coordinates and momenta.* It follows that 


=/■••• “O'■ ■ “0. 

J %} • • • ?»)/ * • • ©i>) 

J'j Wi,•••?,); / \d(Qu...Qo) 


=/•••/(: 


d{Pr,...P:) 


dQ\ • •. dQ, 


* See equation (29). 
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and 


/■■/Gte*!)'"-'- 

J 'J UA,...P„)j \d{Q„...Q.)) 


The multiple integral 

/• • A- • • • ... d? N» 

which may also be written 

If . . • dq^dqi . » . d^f^y 


(161) 


(162) 


and which, when taken within any given limits of phase, has 
been shown to have a value independent of the cotirdinates 
employed, expresses what we have called an extension-in- 
phase,"* In like manner we may say that the multiple integral 
(148) expresses an extension-in-conjiguration^ and that the 
multiple integrals (149) and (150) express an extensionrin- 
velocity. We have called 

dpi , , , dp^dqi ... dq^i (I^) 

which is equivalent to 

, .,dq^dqi., . dq^y (164) 

an element of extension-in-phase. We may call 

...dq, (155) 

an element of extension-in-configuration, and 

\^dj>i ... dq>„, (156) 


* See Chapter I, p, 10. 
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or its equivalent 

.. .din, ( 1 ^ 7 ) 

an element of extension-in-velocity. 

An extension-in-phase may always be regarded as an integral 
of elementar}^ extensions-in-configuration multiplied each by 
an extension-in-velocity. This is evident from the formulae 
(151) and (152) which express an extension-in-phase, if we 
imagine the integrations relative to velocity to be first carried 
out. 

The product of the two expressions for an element of 
extension-in-velocity (149) and (150) is evidently of the same 
dimensions as the product 

Pnii ^ •-in 

that is, as the nth power of energy, since every product of the 
fonn has the dimensions of energy. Therefore an exten- 
sion-in-velocity has the dimensions of the square root of the 
nth power of energy. Again we see by (155) and (156) that 
the product of an extension-in-configuration and an extension- 
in-velocity have the dimensions of the 7zth power of energy 
multiplied by the nth power of time. Therefore an extension- 
in-configuration has the dimensions of the nth power of time 
multiplied by the square root of the 7ith power of energy. 

To the notion of extension-in-configuration there attach 
themselves certain other notions analogous to those which have 
presented themselves in connection with the notion of ex- 
tension-in-phase. The number of systems of any ensemble 
(whether distributed canonically or in any other manner) 
which are contained in an element of extension-in-configura- 
tion, divided by the numerical value of that element, may be 
called the den%ity-in-conjiguration. That is, if a certain con¬ 
figuration is specified by the coordinates and the 

number of systems of which the coordinates fall between the 
limits q^ and q^ -f dq^ % + dq^ is expressed by 

. . . dq^. 


( 168 ) 
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Dq will be the density-in-configiiration. And if we set 

e’’« = ^S (159) 

where N denotes, as usual, the total number of systems in the 
ensemble, the probability that an unspecified system of the 
ensemble will fall within the given limits of configuration, is 
expressed by 

. . . dq„. (160) 

We may call the coefficient of probability of the confguror 
tion^ and the index of probability of the configuration. 

The fractional part of the whole number of systems which 
are within any given limits of configuration will be expressed 
by the multiple integral 

J. . f e''‘‘^^‘dq, . . . dq,. ( 161 ) 

The value of tliis integral (taken within any given configura¬ 
tions) is therefore independent of the system of coordinates 
which is used. Since the same has been proved of the same 
integral without tlie factor it follows that the values of 
7]q and Dq for a given configuration in a given ensemble are 
independent of the system of coordinates which is used. 

The notion of extension-in-velocity relates to systems hav¬ 
ing the same configuration.'^ If an ensemble is distributed 
both in configuration and in velocity, we may confine our 
attention to those systems which are contained within certain 
infinitesimal limits of configuration, and compare the whole 
number of such systems with those which are also contained 

* Except in some simple cases, such as a system of material points, we 
cannot compare velocities in one configuration with velocities in another, and 
speak of their identity or difference except in a sense entirely artificial. We 
may indeed say that we call the velocities in one configuration the same as 
those in another when the quantities <71,... have the same values in the 
two cases. But this signifies nothing until the system of coordinates has 
been defined. We might identify the velocities in the two cases whicli make 
the quantities Pi, . . • the same in each, Tliis again would signify nothing 
independently of the system of coordinates employed. 
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within certain infinitesimal limits of velocity. The second 
of these numbers divided by the first expresses the probability 
that a system which is only specified as falling within the in¬ 
finitesimal limits of configuration shall also fall within the 
infinitesimal limits of velocity. If the limits with respect to 
velocity are expressed by the condition that the momenta 
shall fall between the limits and p^ + dp^^ • •Pn a-^^d 
Pn + extension-in-velocity within those limits will be 

.. .dp^, 

and we may express the probability in question by 

. . . dp„. (162) 

This may be regarded as defining 7;p. 

The probability that a system which is only specified as 
having a configuration within certain infinitesimal limits shall 
also fall within any given limits of velocity will be expressed 
by the multiple integral 

J...Je^Aidp,.. .dp„, (163) 

or its equivalent 

S'" 

taken witliin the given limits. 

It follows that the probability that the system will fall 
within the limits of velocity, and • • • Sn and 

q„ + dq„ is expressed by 

e^'’A,idq,.. .dq^. (166) 

The value of the integrals (163), (164) is independent of 
the system of coonlinates and momenta which is used, as is 
also the value of the same integrals without the factor 
therefore the value of t}^ must be independent of the 
system of coordinates and momenta. We may call g’’® the 
coefficient of probability of velocity, and tlie index of proha- 
hility of velocity. 
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Comparing (160) and (162) with (40), we get 

= P = e’' (166) 

or = V- (107) 

That is: the product of the coefficients of probability of con¬ 
figuration and of velocity is equal to the coefficient of proba¬ 
bility of phase; the sum of the indices of probability of 
configuration and of velocity is equal to the index of 
probability of phase. 

It is evident that A and have the dimensions of the 
reciprocals of extension-in-configuration and extension-in¬ 
velocity respectively, i. the dimensions of €“* and €“», 
where t represent any time, and € any energy. If, therefore, 
the unit of time is multiplied by c,, and the unit of energy by 
c,, every 7}g will be increased by the addition of 

n log Ct-\- in log c*, (168) 

and every rjp by the addition of 

^nlogc,* (169) 

It should be observed that the quantities which have been 
called extension-in-conflguration and extension-in-velocity are 
not, as the temis might seem to imply, purely geometrical or 
kinematical conceptions. To express their nature more fully, 
they might appropriately have been called, respectively, the 
dynamical measure of the extension in configurationy and the 
dynamical measure of the extension in velocity. They depend 
upon the masses, although not upon the forces of the 
system. In the simple case of material points, where each 
■point is limited to a given space, the extension-in-configiiration 
is the product of the volumes within which the seveml points 
are confined (these imiy be the same or different), multiplied 
by the square root of the cube of the product of the masses of 
the several points. The extension-in-velocity for such systems 
is most easily defined as the extension-in-configumtion of 
systems which have moved from the same configuration for 
the unit of time with the given velocities. 

* Compare (47) in Chapter L 
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In the general case, the notions of extension-in-configuration 
and extension-in-velocity may be connected as follows. 

If an ensemble of similar systems of n degrees of freedom 
have the same configuration at a given instant, but are distrib¬ 
uted throughout any finite extension-in-velocity, the same 
ensemble after an infinitesimal interval of time St will be 
distributed throughout an extension in configuration equal to 
its original extension-in-velocity multiplied by St\ 

In demonstrating this theorem, we shall write j'j', • • • qj for 
the initial values of the coordinates. The final values will 
evidently be connected with the initial by the equations 

qi — ?i' = q^st, . . . qr, — qj ~ qjt (170) 

Now the original extension-in-velocity is by definition repre¬ 
sented by the integral 

J. . .J . . . dq,, (171) 

where the limits may be expressed hy an equation of the form 

(172) 

The same integral multiplied by the constant St^ may be 
written 

J .. .J (173) 

and the limits may be written 

F (qx . . . i) = fiqx Bt,... q^ St) = 0. (174) 

(It will be observed that St as well as Aj is constant in the 
integrations.) Now this integral is identically equal to 

f. . d(qx - qx') . . . d(g. . . . g„0, (176) 

or its equivalent 

If ii-i dqx .. . dq^, (176) 

with limits expressed by the equation 

J (5i - Qi’ •••?» — g»') = 0. 


(177) 
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But the systems which initially had velocities satisfying the 
equation (172) will after the interval 8^ have configurations 
satisfying equation (177). Therefore the extension-in-con- 
figuration represented by the last integral is tliat wliich 
belongs to the systems which originally had the extension-in- 
velocity represented by the integral (171). 

Since the quantities which we have called extensions-in- 
phase, extensions-in-configuration, and extentions-in-velocity 
are independent of the nature of the system of coordinates 
used in their definitions, it is natural to seek definitions which 
shall be independent of the use of any coordinates. It will be 
sufficient to give the following definitions without formal proof 
of their equivalence with those given above, since they are 
less convenient for use than those founded on systems of co¬ 
ordinates, and since we shall in fact have no occasion to use 
them. 

We commence with the definition of extension-in-velocity. 
We may imagine n independent velocities, ,... of which a 

system in a given configuration is capable. We may conceive 
of the system as having a certain velocity combined with a 
part of each of these velocities V^.,, V^, By a part of F^ is 
meant a velocity of the same nature as \\ but in amount being 
anything between zero and Vy Now all the velocities which 
may be thus described may be regarded as forming or lying in 
a certain extension of which we desire a measure. The case 
is greatly simplified if we suppose that certain relations exist 
between the velocities Fj,... F„, viz ; that the kinetic energy 
due to any two of these velocities combined is the sum of the 
kinetic energies due to the velocities separately. In this case 
the extension-in-motion is the square root of the product of 
the doubled kinetic energies due to the n velocities ,... 
taken separately. 

The more general case may be reduced to this simpler case 
as follows. The velocity F^ may always be regarded as 
composed of two velocities and of which is of 
the same nature as Fj, (it may be more or less in amount, or 
opposite in sign,) while py' satisfies the relation that the 
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kinetic energy due to and combined is the sum of the 
kinetic enei*gies due to these velocities taken separately. And 
the velocity Kg may be regarded as compounded of three, 
Vq, Kg'", of which Kg' is of the same nature as Kj , F^" 
of the same nature as while Kg'" satisfies tlie relations 
that if combined either with K^ or F^" the kinetic energy of 
the combined velocities is the sum of the kinetic energies of 
the velocities taken separately. When all the velocities 
f '2 ’ • • * have been tlius decomposed, the square root of the 
product of the doubled kinetic energies of the several velocities 
Kj, Kg", Kg'", etc., will be the value of the extension-in- 
velocity which is sought. 

This method of evaluation of the extension-in-velocity which 
we are considering is perhaps the most simple and natural, but 
the result may be expressed in a more S 3 anmetrical form. Let 
us write ejg for the kinetic energy of the velocities Kj and 
combined, diminished by the sum of the kinetic energies duo 
to the same velocities taken separately. This may be called 
the mutual energy of the velocities Kj and Kg. Let the 
mutual energy of every pair of the velocities K^ ,... F^ be 
expressed in the same way. Analogy would msike represent 
the energy of twice diminished by twice the energy of Kj , 
z. e,, would represent twice the energy of , although the 
term mutual energy is hardly appropriate to this case. At all 
events, let 6^ have this signification, and egg represent twice 
the energy of F^, etc. The square root of the determinant 

^11 ^12 • • • 

€21 €22 . . . € 2 n 

€»1 fn2 • * • ^nn 

represents the value of the extension-in-velocity determined as 
above described by the velocities K^ ,... F^. 

The statements of the preceding paragraph may be readily 
proved from the expression (167) on page 60, viz., 

A1 

by which the notion of an element of extension-in-velocity was 
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originally defined. Since in this expression represents 
the determinant of which the general element is 

dqidqj 

the square of the preceding expression represents the determi¬ 
nant of which the general element is 

dh . . 

Now we may regard the differentials of velocity dqi, dq^ as 
themselves infinitesimal velocities. Then the last expression 
represents the mutual energy of these velocities, and 

dH . 

represents twice the energy due to the velocity dq^. 

The case w'hich we have considered is an extension-in-veloc¬ 
ity of the simplest form. All extensions-in-velocity do not 
luive this form, but all may be regarded as composed of 
elementary extensions of this form, in the same manner as 
all volumes may be regarded as composed of elementary 
parallelepipeds. 

Having thus a measure of extension-in-velocity founded, it 
will be observed, on the dynamical notion of kinetic energy, 
and not involving an explicit mention of cocirdinates, we may 
derive from it a measure of extension-in-configuration by the 
principle connecting these quantities which has been given in 
a preceding paragraph of this chapter. 

The measure of extension-in-phase may be obtained from 
that of extension-in-configuration and of extension-in-velocity. 
For to eveiy configuration in an extension-in-phase there will 
belong a certain extension-in-velocity, and the integral of the 
elements of extension-in-configuration within any extension- 
in-phase multiplied each by its extension-in-velocity is the 
measure of the eitension-in-phase. 
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FARTHER DISCUSSION OF AVERAGES IN A CANONICAL 
ENSEMBLE OF SYSTEMS. 


PwETURNiNG to the case of a canonical distribution, we have 
for the index of probability of configuration 


Vq — 



(178) 


as appears on comparison of fornulae (142) and (161). It 
follows immediately from (142) that the average value in the 
ensemble of any quantity w whicli depends on the configura¬ 
tion alone is given by tlie formula 


=/ •/» 


e ® 


(179) 


conflg. 


where the integrations cover all possible configurations. The 
value of i/tj is evidently determintd by the equation 


* =J • • -J e . . . dq^. 


(180) 


conflg. 


By differentiating the last equation we may obtain results 
analogous to those obtained in Chapter IV from the equation 


*11 

=J • • -J « ^<^Pi • • • dq^ 


phaMs 


As the process is identical, it is sufficient to give the results: 

^^9 = r}qd0 — Aidai — Jldajt — etc., (181) 
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or, since = (182) 

and 4 - (1®^) 

dig = — — Aidai — A^da^ — etc. (1^4) 


It appears from this equation that the differential relations 
subsisting between the average potential energy in an ensem¬ 
ble of systems canonically distributed, the modulus of distri¬ 
bution, the average index of probability of configuration, taken 
negatively, and the average forces exerted on external bodies, 
are equivalent to those enunciated by Clausius for the potential 
energy of a body, its temperature, a quantity which he called 
the disgregation, and the forces exerted on external bodies.* 
For the index of probability of velocity, in the case of ca¬ 
nonical distribution, we have by comparison of (144) and (163), 
or of (145) and (164), 



1 0 

II 

(185) 

which gives 

0 > 

(186) 

we have also 


(18T) 

and by (140), 

i n 0 log ( 27 r 0 ). 

(188) 

From these equations we get by differentiation 



#p = ’ 7 j> 

(189) 

and 

t/e, = — @ 

(190) 


The differential relation expressed in this equation between 
the average kinetic energy, the modulus, and the average index 
of probability of velocity, taken negatively, is identical with 
that given by Clausius locis citati% for the kinetic energy of a 
body, the temperature, and a quantity which he called the 
transformation-value of the kinetic energy, f The relations 

c = Cff + V, V = Vff + 

♦ Pogg. Aim., Bd. CXVI, S. 73. (1862); ibid., Bd. CXXV, S. 363, (1865). 
See alBO Boltzmann, Sitzb. der Wiener Akad., Bd. LXIII, S. 728, (1871). 
t VerwandlungBwerth des Warmeinhaltes. 
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are also identical with those given by Clausius for the corre¬ 
sponding quantities. 

Equations (112) and (181) show that if or is known 
as function of 0 and etc., we can obtain by differentia¬ 
tion e or €q, and etc. as functions of the same varia¬ 

bles. We have in fact 

7 = 1 /- — ©^=^ — ®^. ( 191 ) 

^1 = 4', —®v<, = C92) 


The corresponding equation relating to kinetic energy, 

= — = — ® as ‘ 


( 193 ) 


which may be obtained in the same way, may be verified by 
the known relations (186), (187), and (188) between the 
variables. We Imve also 


A, 


dip 

dai dal 


( 194 ) 


etc., so that the average values of the external forces may be 
derived ahke from ^ or from 

The average values of the squares or higher powers of the 
energies (total, potential, or kinetic) may easily be obtained by 
repeated differentiations of ^|ry ^|rg, ^jrpy or e, 6^, €p, with 
respect to 0. By equation (108) we have 



phases 


and differentiating with respect to 0, 



phases 


whence, again by (108), 


de P — tj/e 

^ ■ ®» 


c d>l/ 

©5©’ 


(196) 


dq„ (196) 
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Combimng this with (191), 




In precisely the same way, from the equation 
_ C i 

' ^ a/ dyi.., 

oonfig. 

we may obtain 


In the same way also, if we confine ourselves to a particular 
configuration, from the equation 


■“ r.1 


=/■/ 


6/ Ap dpi. . . 


we obtain 


+ ®^d®-I*’ -®-M) ®d&' 


which by (187) reduces to 

^ — (J n* + i n) 


Since this value is independent of the configuration, we see 
that the average square of the kinetic energy for every configu¬ 
ration is the same, and therefore the same as for the whole 
ensemble. Hence ^ may be interpreted as the average either 
for any particular configuration, or for the whole ensemble. 
It will be observed that the value of this quantity is deter¬ 
mined entirely by the modulus and the number of degrees of 
freedom of the system, and is in other respects independent of 
tlie nature of the system. 

Of especial importance are the anomalies of the energies, or 
their deviations from their average values. The average value 
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of these anomalies is of course zero. The natural measure of 
such anomalies is the square root of their average square. Now 

(c — c)^ = ? — € (204) 

identically. Accordingly 

In like maimer, 

Hence 

(*-«)' = («, - *,)* + (v - ‘pK (208) 

Equation (206) shows that the value of f?e^/d0 can never be 
negative, and that the value of or can never 

be positive.* 

To get an idea of the order of magnitude of these quantities, 
we may use the average kinetic energy as a tenn of comparison, 
tliis quantity being independent of the arbitrary constant in¬ 
volved in the definition of the potential eii(*igy. Since 

♦ In the case rliscussed in the note on page 54, in which the potential 
energy is a quadratic function of the (/'s, and independent of the we 
should get for the potential energy 

(«<7 - 

and for the total energy _ 

(€ - «)* = ne2. 

We may also write in tlu'; case, 

- ? 

(«? “ fa)’* n 

n* 
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€p — JWr®! 

2 
—» 


? «dep 

(e - f)'‘ 

«/ n n aj^ 



(209) 

( 210 ) 
( 211 ) 


These equations show that when the number of degrees of 
freedom of the systems is veiy great, the mean squares of the 
anomalies of the energies (total, potential, and kinetic) are very 
small in comparison with the mean square of the kinetic 
energy, unless indeed the differential coefficient dcq/dep is 
of the same order of magnitude as n. Such values of dejdcp 
can only occur within intervals — 6^') which are of tlie or¬ 
der of magnitude of unless it be in cases in which Iq is in 
general of an order of magnitude higher than e^. Postponing 
for the moment the consideration of such cases, it will be in¬ 
teresting to examine more closely the case of large values of 
de^/dep within narrow limits. Let us suppose that for and 
the value of degfdep is of the order of magnitude of unity, 
but between these values of very great values of the differ¬ 
ential coefficient occur. Then in the ensemble having modulus 
and average energies and €g", values of sensibly greater 
than will be so rare that we may call them practically neg¬ 
ligible. They will be still more rare in an ensemble of less 
modulus. For if we differentiate the equation 



regarding €g as constant, but 0 and therefore as variable, 
we get 

Vd® j." ® rf® ®* ’ 


whence by (192) 
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That is, a diminution of the modulus will diminish the proba¬ 
bility of all configurations for which the potential energy exceeds 
its average value in the ensemble. Again, in the ensemble 
having modulus and average energies ej and e^', values of 
€q sensibly less than wifi, be so rare as to be practically neg¬ 
ligible. They will be still more rare in an ensemble of greater 
modulus, since by the same equation an increase ol the 
modulus will diminish the probability of configurations for 
which the potential energy is less than its avemge value in 
the ensemble. Therefore, for values of 0 between 0' and 0", 
and of between ej and 6p", the individual values of will 
be practically limited to the interval between and 

In the cases which remain to be considered, viz., when 
d \ld €p has very large values not confined to narrow limits, 
and consequently the differences of the mean potential ener¬ 
gies in ensembles of different moduli are in. general very large 
compared with the differences of the mean kinetic energies, it 
appears by (210) that the anomalies of mean square of poten¬ 
tial! energy, if not small in comparison with the mean kinetic 
energy, will yet in general be very small in comparison with 
differences of mean potential energy in ensembles having 
moderate differences of mean kinetic energy, — the exceptions 
being of the same charaeter as described for the case when 
d€q /dcp is not in general large. 

It follows tliat to human experience and observation with 
respect to such an ensemble as we are considering, or -with 
respect to systems which may be regarded as taken at random 
from such an ensemble, when the number of degrees of free¬ 
dom is of such order of magnitude as the number of molecules 
in the bodies subject to our observation and experiment, € — e, 

— ep, €q — €g would be in general vanishing quantities, 
since such experience would not be wide enough to embrace 
the more considerable divergencies from the mean values, and 
such observation not nice enough to distinguish the ordinaiy 
divergencies. In other words, such ensembles would appear 
to human observation as ensembles of systems of uniform 
energy, and in which the potential and kinetic energies (sup- 
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posing that there were means of measuring these quantities 
separately) had each separately uniform values.* Exceptions 
might occur when for particular values of the modulus the 
differential coefficient takes a very large value. To 

human observation the effect would be, that in ensembles in 
which @ and had certain critical values, would be in¬ 
determinate within certain limits, viz., the values which would 
correspond to values of 0 and slightly less and slightly 
greater than the critical values. Such indeterminateness cor- 


responds precisely to what we observe in experiments on the 
bodies which nature presents to us.-f 

To obtain general fonnulae for the average values of powers 
of the energies, we may proceed as follows. If h is any posi¬ 
tive whole number, we have identically 

aU € 

phaaAS 

all c 

phaaes 

..dq,, (214) 

i. e., by (108), 


(216) 

4' 4' 

Hence ?e ® ® 

(216) 

and 


(217) 


♦ This implies that the kinetic and potential energies of individual systems 
would each separately have values sensibly constant in time. 

t As an example, we may take a system consisting of a fluid in a cylinder 
under a weighted piston, with a vacuum between the piston and the top of 
the cylinder, which is closed. The weighted piston is to be regarded as a 
part of the system. (This is formally necessary in order to satisfy the con¬ 
dition of the invariability of the external coiirdinatea.) It is evident that at 
a certain temperature, viz., when the pressure of saturated vapor balances 
the weight of the piston, there is an indeterrainatcness in the values of the 
potential and total energies as functions of the temperature. 
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For A = 1, this gives 


i = — ©a. 


which agrees with (191). 
From (216) we have also 




In like manner from the identical equation 


ftU all 

j ■■■J ^ \^dqx...dq^=(t^—j ... J 

n pnflg . OOn£g> 

-i? 

weget ^ = '*(,«■&)' *• 




With respect to the kinetic energy similar equations will 
hold for averages taken for any particular configuration, or 
for the whole ensemble. But since 


i 

^ 2 


the equation 


;? = («r + ®* d®j ^ 


reduces to 
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We have therefore 



(226) 


(227) 


*(228) 

The average values of the powers of the anomalies of the 
energies are perhaps most easily found as follows. We have 
identically, since e is a function of 0, while e is a function of 
the jo’s and ^’s, 

aU f 

^dp„...dq^ 

^hajtMk 


, r -1 ^ 

dq. (229) 


phases 


i. € - by (108), 




d r—- 

5@[_(e-e)*e 



e (e - £)* _ A (e - e)*-> ^ 



4 

® (230) 


* In the caee discussed in the note on page 64 we may easily get 

(fg - Ca)* = - ta-h («2 - 

which, with — «a ~ ^ 0 , 

gives 

.- -7 /n „ ^ \ ,-r-T ” / ” « ^ 

~ •a)^ = e + 02 j (s, - ea)^-^ = 2 (2 ® 

Hence - e„p^ = 7 ^. 


Again 

which with 
gives 


(7 - €a)^ = ~ €a 4- 02 (7- €a)^-^ 


e — fa = n 0 


(e-«a)* = ^ne + (»-€«)*-> = n(ne + e»^) e, 

r(n + A), 


(•-*„)*_ e*. 


hence 
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or since by (218) 

®* ^ ^ - A («- «)*-' ®‘ jI ’ 

(e - i)*+' = ®» ^ (€- e)* + A(€ - i)*-i O’ (231) 


In precisely the same way we may obtain for the potential 
energy 

_ ^ 

(e, - d® + A (e, - (232) 

By successive applications of (231) we obtain 
(e — = De 

(e =r* D^e 

{e-'iy = i>« e + 3 {D'iy 
(7"- ey = D^i + 10Z>€D^ 

(€ - 'ey = -f- + 10(2>2i)» + 15{Dly etc. 

where D represents the operator Similar expres¬ 

sions relating to the potential energy may be derived from 
(232). 

For the kinetic energy we may write similar equations in 
which the averages may be taken either for a single configurar 
tion or for the whole ensemble. But since 

d€p _ n 
d% 2 

the general formula reduces to 

(e^ - i^)^> = ^ (e^ _ i^)» + jnA®» (e, - e^)*-* (233) 

or 


(6p - ip) *+> ^ 20 jrf (6p - ep) » 2h (€p - epY ^ (e^-i^) *-* 

fjM-i n d® ep* « 

(234) 
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But since identically 


1 

o 

II 

€ 

the value of the corresponding expression for any index will 
be independent of 0 and the formula reduces to 

("-T 

)* + v(\*)*"' 

we have therefore 


1 

o 

II 

(“?)■=5- 

1 

II 

/'6,-ipY_48 12 

V n*' 

C'F/') =? 

etc.* 


It will be observed that when or e is given as function of 
all averages of the form ? or (e —are thereby deter- 

♦ In the case discussed in the preceding foot-notes we get easily 

(<<7 ~ =(fp- fp)\ 



1%^ ^(*JL 


\.j- 


*p ' 


For the total energy we have in this case 



etc. 
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mined. So also if or \ is given as function of 0, aP 
averages of the form ^ or (e^ — are determined. But 


= e — in0. 

Therefore if any one of the quantities e, is known 

as function of 0, and n is also known, all averages of any of 
the forms mentioned are thereby determined as functions of 
the same variable. In any case all averages of the form 



are known in terms of n alone, and have the same value 
whether taken for the whole ensemble or limited to any 
particular configuration. 

If we difi’erentiate the equation 


/ •/■ 

phases 




dpx 


dq„ = 1 


with respect to aj, and multiply by we hare 


(236) 


/■ ■ liZ - =»■ ( 2 ^') 

Differentiating again, with resf)ect to Uj, with respect to 
and with respect to 0, we have 



da^ 0\dai 

d^e 1 
2 dob\d(Xf^ 0 



e ^ dpi.,.dq^=: 0, (238) 

de \ / d</r ^ de \ 1 
d«i/\d«2 da^Jj 


e = (239) 



( ^ 


(fe \ / }_d^ _ — e \~j 

dui} \ 0 (?@ / J 

fc! 

e * dpi .. . = 0. (240) 
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The multiple integrals in the last four equations represent the 
average values of the expressions in the brackets, which we 
may therefore set equal to zero. The first gives 

as already obtained. With this relation and (191) we get 
from the other equations 

(^1 -Ij.) -® ^J (- 42 ) 

(Ai - JT,) (Jj - Ji) = @ 


' dAi dAi^ 

^ dci^ clct^ j 




(^1 - 2,) (e - e) 


daid(d d& 


We may add for comparison equation (205), which might be 
derived from (236) by differentiating twice with respect to 0: 

(e_i)» = _®-g = ©«*. (244) 

The two last equations give 


A - Ai){€ ~7) = —;(€ - €■)>. 

de 


If i/r or € is known as function of 0, etc., (e m^y 

be obtained by differentiation as function of the same variables. 
And if i/r, or Aj, or rj is known as function of 0, etc., 
(e — 6) may he obtained by differentiation. But 
and j — Aj) (^A 2 — 4^) cannot be obtained in any 

similar manner. We have seen that (e—e)^ is in general a 
vanishing quantity for very great values of n, which we may 
regard as contained implicitly in 0 as a divisor. The same is 

true of (A 2 — J[j) (e — It does not appear that we can 
assert the same of or (A^ — A^) (A^ — ^ 2 )’ since 
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^€jda^ may be very great. The quantities cPejda-^ and cPyjrjda^^ 
belong to the class called elasticities. The former expression 
represents an elasticity measured under the condition that 
while aj is varied the internal cooitiinates all remain 

fixed. The latter is an elasticity measured under the condi¬ 
tion that when is varied the ensemble remains canonically 
distributed within the same modulus. This corresponds to 
an elasticity in physics measured under the condition of con¬ 
stant tempeniture. It is evident that the former is greater 
than the latter, and it may be enormously greater. 

The divergences of the force from its average value are 
due in part to the differences of energy in the systems of the 
ensemble, and in part to the differences in the value of 
the forces which exist in systems of the siime energy. If we 
wTite -371 average value of -3 j in systems of the 

ensemble which have any siime energy, it will be determined 
by the equation 




If- 


dai 


dp^ . ..dq^ 


If 


dp. 


( 246 ) 


• dq. 


where the limits of integration in both multiple integrals are 
two values of the energy which differ infinitely little, say e and 

iff—€ 

€ 4- de. This will make the factor e ® constant within the 
limits of integration, and it may be cancelled in the numera¬ 
tor and denominator, leaving 


311 .= 



( 247 ) 


where the integrals as before are to be taken between € and 
e -f d€, -37l€ is therefore independent of 0, being a function 

of the energy and the external cotirdinates. 
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Now we have identically 

-^1 — = (^1 — + (37], J - i ), 

where — denotes the excess of the force (tending to 
increase a^) exerted by any system above the average of such 
forces for systems of the same energy. Accordingly, 

But the average value of (Aj — A7]«) — 3^) for systems 

of the ensemble which have the same energy is zero, since for 
such systems the second factor is constant. Therefore the 
average for the whole ensemble is zero, and 

(A, - ^ (A,-^ A:\,y + (j; jc ~ A^\ (248) 


In the same way it may be shown that 

(3i - 3i) (€-€)=: (All, ^ (e - (249) 

It is evident that in ensembles in which the anomalies of 
energy e — e may be regarded as insensible the same will be 
true of the qaantities represented by A7|« — Ay 

The properties of quantities of the form will he 
farther considered in Chapter X, which will be devoted to 
ensembles of constant energy. 

It may not be without interest to consider some general 
formulae relating to averages in a canonical ensemble, which 
embrace many of the results which have been given in this 
chapter. 

Let u be any function of the internal and external coordi¬ 
nates with the momenta and modulus. We have by definition 

aU ^ 

w = J • • -J ue ^ dpi , , , dq^ (250) 

phaaes 

If we differentiate with respect to 0, we have 


du 

dS 


=/• •-/(S - J + i S)" * 
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du du u(^ — e) u dij/ 
d®~d@ ©* ^ ^ ■ 

Setting ai = 1 in this equation, we get 

d\j/ il/ — € 

and substituting this value, we have 

du U€ U€ 

(i© ~ rf©'^ ^ ^ 


( 261 ) 


or 


02 


du ^ du 
<f0 


= tte — U€ (u — u) (e — 6). 


(252) 


If we differentiate equation (250) with respect to a (which 
may represent any of the external coordinates), and write A 

for the force ~ ^ 


<hi 

da 


all / V 

/ (*[ du u dll/ ti 

■JU+8s;+®^;* 

pliases 


t! 

® dpi. .. dq. 


or 


du 

da 


du u dif/ uA 
da 0 da 0 


Setting 1 in this equation, we get 


(253) 


diji 

da 




Substituting this value, we have 

du ^ du uA uA 
da da^ 0 0 


(254) 


^ ^ ^ ^ ~ ^ — ii) (-4 — A). (255) 

Repeated applications of the principles expressed by equa¬ 
tions (252) and (255) are perhaps best made in the particular 
eases. Yet we may write (252) in this form 
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(e 4- I>) {u^u)z=. 0, (256) 

where D represents the operator d/i0. 

Hence 

(e + Df (u = 0, (257) 

where h is any positive whole number. It will be observ^ed, 
that since e is not function of @, (e -f 1)Y may be expanded bv 
the binomial theorem. Or, we may write 

(<e VD)n (e 4- I>) u, (258) 

whence (c d- D f w ~ (e 4 X>)* u. (259) 

But the operator (e 4 D)\ although in some respects more 
simple than the operator without the average sign on the 
cannot be expanded by the binomial theorem, since e is a 
function of © with the external coordinates. 

So from equation (254) we have 


whence 


and 


whence 



The binomial theorem cannot lx? applied to tliese operators. 

Again, if now distinguish, as usual, the several external 
coordinates by suffixes, we may apply successively to the 
expression w — i any or all of the operators 

^ d- 4 ® 4 0 etc. (264) 

d© ‘ dui da^ ^ 
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aa many times as we choose, and in any order, the average 
value of the result will be zero. Or, if we apply tlie same 
operators to u, and finally take the average value, it will be the 
same as the value obtained by writing the sign of average 
separately as m, and on e, A^, etc., in all the operators. 

If M is independent of the momenta, formulae similar to 
the preceding, but having e, in place of e, may be derived 
from equation (179). 



CHAPTER VIII. 


ON CERTAIN IMPORTANT FUNCTIONS OF THE 
ENERGIES OF A SYSTEM. 

In order to consider more particularly the distribution of a 
canonical ensemble in energy, and for other purposes, it will 
be convenient to use the following definitions and notations. 

Let us denote by V the extension-in-phase below a certain 
limit of energy which we shall call e, Tliat is, let 

v= ( 265 ) 

the integration being extended (with conshmt values of the 
external coordinates) over all phases for which the energy is 
less than the limit e. We shall suppose that the value of this 
integral is not infinite, except for an infinite value of the lim¬ 
iting energy. This will not exclude any kind of system to 
which the canonical distribution is applicable. For if 



taken without limits has a finite value,* the less value repre¬ 
sented by 



taken below a limiting value of e, and with the € before the 
integral sign representing that limiting value, will also be 
finite. Thei’efore the value of which differs only by a 
constant factor, will also be finite, for finite €. It is a func¬ 
tion of 6 and the external coordinates, a continuous increasing 

♦ This is a necessary condition of the canonical distribution. See 
Chapter IV, p. 36. 
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function of e, which becomes infinite with e, and vanishes 
for the smallest possible value of €, or for e = — oo, if the 
energy may be diminished without limit. 

Let us also set 

<^ = log|P (266) 


The extension in phase between any two limits of energy, e' 
and e", will be represented by the integral 



(267) 


And in general, we may substitute de for dp ^... dq^ in a 
2M-fold integral, reducing it to a simple integral, whenever 
the limits can be expressed by the energy alone, and the other 
factor under the integral sign is a function of tlie energy al<.)ne, 
or with quantities which are constant in the integnitiori. 

In particular we observ^e that the probability that the energy 
of an unspecified system of a canonical ensemble lies between 
the limits e and e" will be represented by the integral * 



de, 


(268) 


and that the average value in the ensemble of any (piantity 
v/hich only varies with the energy is given by the equation f 


« = 30 

f/ = I we ® de, (269) 

^=0 

where we may regard the constant yjr as determined by the 
equation J 




e 


0 



(270) 


F=0 

In regard to the lower limit in these integrals, it will be ob- 
served that V = 0 is equivalent to tlie condition that the 
value of € is the least possible. 


♦ Compare equation (93). t Compare equation (108). 

t Compare equation (92). 
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Ill like manner, let us denote by Vq the extension-in-configu- 
ration below a certain limit of potential energy which we may 
call €q. That is, let 

F, =y’. . J A,* dq^... dq,, (271) 

the integration being extended (with constant values of the 
external coordinates) over all configurations for wliich the 
potential energy is less than €g, Vg will be a function of eg 
with the external coordinates, an increasing function of 
which does not become infinite (in such cases as we shall con¬ 
sider*) for any finite value of eg. It vanishes for the least 
possible value of or for ~ — oo, if eg can be diminished 
without limit. It is not always a continuous function of eg. 
In fact, if there is a finite extension-in-configuration of con¬ 
stant potential energy, the corresponding value of Vg will 
not include that extension-in-eonfiguration, but if eg be in¬ 
creased infinitesimally, the corresponding value of Vg will be 
increased by that finite extension-in-configuration. 

Let us also set 

= log (272) 

The extension-in-configuration between any two limits of 
potential energy €g and e/' may be represented by the integral 

P e*^de, (273) 

whenever tliere is no discontinuity in the value of Vg as 
function of eg between or at those limits, that is, when¬ 
ever there is no finite extension-in-configuration of constant 
potential energy Ixjtween or at the limits. And in general, 
with the restriction mentioned, we may substitute de^ for 
. . . dqn in an w-fold integral, reducing it to a simple 
integral, when the limits are expressed by the potential energy, 
and the other factor under the integral sign is a function of 

♦ If Vq were infinite for finite values of V would evidently be infinite 
for finite values of €. 
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the potential energy, either alone or with quantities which are 
constant in the integration. 

We may often avoid the inconvenience occasioned by for¬ 
mulae becoming illusory on account of discontinuities in the 
values of Vg as function of Cg by substituting for the given 
discontinuous function a continuous function which is practi¬ 
cally equivalent to the given function for the purposes of the 
evaluations desired. It only i-equires infinitesimal changes of 
potential energy to destroy the finite extensions-in-configui*a- 
tion of constant potential energy which are the cause of the 
difficulty. 

In the case of an ensemble of systems canonically distributed 
in configuration, when Vg is, or may be regarded as, a continu¬ 
ous function of €g (within the limits considered), the proba¬ 
bility that the potential energy of an unspecified system lies 
between the limits e/ and e/' is given by the integral 





(274) 


where yjr may be determined by the condition that the value of 
the integral is unity, when the limits include all possible 
values of €g. In the same case, the average value in the en¬ 
semble of any function of the potential energy is given by the 
equation 


u 



tils 

0 


+ <Pg 




q* 


(275) 


When Vg is not a continuous function of we may write d Vg 
for e^^d€q in these formulae. 

In like manner also, for any given configuration, let us 
denote by V^ the extension-in-velocity below a certain limit of 
kinetic energy specified by That is, let 





( 276 ) 
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the integration being extended, with constant values of the 
coordinates, botli internal and external, over all values of the 
momenta for which the kinetic energy is less than the limit e^. 
Vp will evidently be a continuous increasing function of 
which vanishes and becomes infinite with Let us set 

= log <2T7) 

The extension-in-velocity between any two limits of kinetic 
energy e^' and Cp' may be represented by the integral 

dtp. (278) 

And in general, we may substitute dep for dp^ , . . dp^ 
or dq^ . . . dq^ in an w-fold integral in whicli the coordi¬ 
nates are constant, reducing it to a simple integral, when the 
limits are expressed by the kinetic energy, and the other factor 
under the integnil sign is a function of the kinetic energy, 
eitlier alone or with quantities which are constant in the 
integration. 

It is easy to express Vp and in terms of e^. Since A^ is 
function of the coordinates alone, we have by definition 


V, = A}J...Jdp,...dp„ (279) 

the limits of the integral being given by That is, if 

tp=^F{p„...p:), (280) 

the limits of the integral for e, = 1, are given by the equation 
F{pu...p:) = l, (281) 

and the limits of the integral for tp = a\ are given by the 
equation 

F{p,,...p;) = a\ (282) 


But since F represents a quadratic function, this equation 
may be written 


'a a/ 


( 283 ) 
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The value of may also be put in the form 






Now we may determine Vp for = 1 from (279) where the 
limits are expressed by (281), and Vp for €p = from (284) 
taking the limits from (283). The two integrals thus deter¬ 
mined are evidently identical, and we have 

n 

z. Vp varies as €p*. We may therefore set 


v^= 


e ~ - Ce 


where (7 is a constant, at least for fixed values of the internal 
coordinates. 

To determine this constant, let us consider the case of a 
canonical distribution, for which we have 

I e ® d€p = ly 


where 


e® = (27r©) 


Substituting this value, and that of from (286), we get 


Cl e ®6/ dep = (2w&f, 




^^^(|) = (22r)^ (287) 

C,. 

^-r(i« + i) 

Having thus determined the value of the constant C, we may 
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substitute it in the general expressions (286), and obtain the 
following values, which are perfectly general: 


” r(i« + i) 




n n 

r(in) 


(288) 

*(289) 


It will be observed that the values of Vp and (j>p for any 
given €p are independent of the configuration, and even of the 
nature of tlie system considered, except with respect to its 
number of degrees of freedom. 

Returning to the canonical ensemble, we may express the 
probability that the kinetic energy of a system of a given 
configuration, but otherwise unspecified, falls within given 
limits, by either member of the following equation 


/■ 






r(in) 




Since this value is independent of the coordinates it also 
represents the probability that the kinetic energy of an 
unspecified system of a canonical ensemble falls within the 
limits. The form of the last integral also shows that the prob¬ 
ability that the ratio of the kinetic energy to the modulus 


* Very similar values for V, and may be found in the same 

way in the case discussed in the preceding foot-notes (see pages 54, 72, 77, and 
79), in which fg is a quadratic function of the and Aq independent of the 
In this case we have 



(2ir)^( €g-ta)" 

r(in-M) ' 
r(Jn) 

(2w)" 

r(n + l) * 
r(«) 
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falls within given limits is independent also of the value of 
the mcxlulus, being determined entirely by the number of 
degrees of freedom of the system and the limiting values 
of the ratio. 

The average value of any function of the kinetic energy, 
either for the whole ensemble, or for any particular configura¬ 
tion, is given by 


1 

; — - I u 

5 Vo 


e ® d£p 


Thus: 


S^van) 

T(m -f i^) 


r(iw) 


if 


r(in + i)r(in) 


m + -Jw > 0 j 
n 


*(291) 

t(292) 

(293) 


♦ The corresponding equation for the average value of any function of 
the potential energy, when this is a quadratic function of the q’s, and is 
independent of the q’s, is 


1 


e;r(4n) 






(«?—Co)* ^ d(i 


In the same case, the average value of any function of the (total) energy is 
given by the equation 

' ' 0 

Hence in this case 

= if ». + |n>0. 

(• — fa)'" = ®’"> if m + n > 0. 


and 





1 

rfe ’’e' 


if 

if 


n > 2, 


n > 1. 


If n = 1, 6^ = 2ir and d<f>ld* = 0 for any value of c. If n =: 2, the case is 
the same with respect to <pq, 

t This equation has already been proved for positive integral powers of 
the kinetic energy. See page 77. 
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\ if « > 1; (294) 


chp _ 2 

dep~ ®' 


if n > 2; 


Vp = a 


If 71 = 2, e*** = 2 TT, and d<f)pld€p = 0, for any value of 
The definitions of V, Vp and Vp give 


-If' 


dVpdVj 


where the integrations cover all phases for which the energy 


is less than the limit €, for which the value of V is 
This gives 

€g=f 

sought. 

F=: r r^dv,, 

(298) 

Vq — Q 


‘‘=* 

(299) 

Vq=zO 


where V^, and are connected with Vg by the equation 

fp 4- — constant = e. 

(300) 

If 71 > 2, vamshes at the upper limit, t. for 

we get by another differentiation 

Cp = 0, and 


(301) 

Vqz=0 


We may also write 


f,=< 


% 

II 

(302) 

Vq-0 


e* J7\*'’+*^de„ 

(303) 

O 

li 
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etc., when is a continuous function of commencing with 
the value = 0, or when we choose to attribute to a 
fictitious continuity commencing with the value zero, as de¬ 
scribed on page 90. 

If we substitute in these equations the values of and 
which we have found, we get 


F= 



(304) 







dVc 




(305) 


where de^ may be substituted for d Vg in the cases above 
described. If, therefore, n is known, and Vg as function of 
V and 6*^ may be found by quadratures. 

It appears from these equations that V is always a continu¬ 
ous increasing function of e, commencing wdth the value V=^ 
0, even when this is not the case with respect to Vg and 
The same is true of when n > 2, or when 7i ~ 2 if Vg in¬ 
creases continuously wdth from the value Vg = 0. 

The last equation may be derived from the preceding by 
differentiation with respect to e. Successive differentiations 
give, if A < ^ w -f 1, 


d'‘V rd'-Vj, 

“j de^'‘ 

r,i=o 




€g = t 


r(i 


n+l~ h)J 




dV,. 


Vq — 0 


(306) 


divide" is therefore positive if A < J 7i + 1. It is an in¬ 
creasing function of e, if A< If e is not capable of 
being diminished without limit, d^Vjdd^ vanishes for the 
least possible value of e, if A < 

If n is even, 


d^V 
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1 d^V 

That is, Vq is the same function of as -—— of e. 

(2 7r)2 

When n is large, approximate formulae will be more avail¬ 
able. It will be suflicient to indicate the method proposed, 
without precise discussion of the limits of its applicability or 
of the degree of its approximation. For tJie value of cor¬ 
responding to any given f, we have 




(308) 


where the variables are connected by the equation (300). 
The maximum value of <f>p -f is therefore characterized by 
the equation 

d<f>j, __ d^^ 


f/cp (fcy 


(309) 


The values of and determined by this maximum we shall 
distinguish by accents, and miirk the corresponding values of 
functions of 6^ and eg in the same way. Now we have by 
d'aylor's tlieorem 

<i>p = + (^^') («. -«.') + ) —2^ + 

( ^")'k - (311) 

If the approximation is sufficient without going beyond the 
quiidmtic terms, since by (300) 

- e,/ = - (e, - e,/), 

we may wite 


<p tp 

e — € f* 


■ V«v/-I de,, 


(312) 


where the limits have been made ± oo for analytical simplicity. 
This is allowable when the quantity in the square brackets 
has a very large negative valu(% since the part of the integral 
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corresponding to other than very small values of — 
he regarded as a vanishing quantity. 

This gives 


or 


> _ Jp'+O,' f 

— 27r 

.V 


)'+ ($) 



J 

4,p'+ + i log (2,r) - i log j^- 

(d^p-v 

\de/) 

(©)'] 


e^' may 


(313) 


(314) 


From this equation, wdtli (289), (300) and (309), we may 
determine the value of cf) corresponding to any given value of 
6, when (f>^ is known as function (d 6^. 

Any two systems may be regarded as together forming a 
thiixl system. If we have V or (f> given as function of e for 
any two systems, we may express by quadratures Fand <}> for 
the wsystem formed by combining the two. If we distinguish 
by the sullixes ( ( ) 2 » ( )i 2 quantities relating to 

the three systems, we have easily from the definitions of these 
quantities 


I' dVidV, = J V,dV, = J l\dV, = J V.e^'de,, 
e*" =Je*'dV, = I"+ 


(315) 

(316) 


where the double integral is to be taken within the limits 
Vi ™ 0, — 0, and Cj + eg = ^ 12 , 

and the variables in the single integrals are connected by the 
last of these equations, while the limits are given by the first 
two, which cliaracterize the least possible values of and 
respectively. 

It will be observed that these equations are identical in 
form with those by which V and (j> are derived from or <f>j, 
and Vq or except that they do not admit in the general 
case those transformations which result from substituting for 
Vp or (f>p the particular functions which these symbols always 
represent. 
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Similar formulae may be used to derive Vg or <l>q for the 
compound system, when one of these quantities is known 
as function of the potential energy in each of the systems 
combined. 

The operation represented by such an equation as 


^13 

e 



<p2 

e 


d€i 


is identical with one of the fundamental operations of the 
theory of errors, viz., tluit of finding the probability of an error 
from the probabilities of partial en'ors of which it is made up. 
It admits a simple geometihial illustration. 

We may take a horizontal line as an axis of abscissas, and lay 
off 6 i as an abscassa measured to the right of any origin, and 
erect as a corresponding ordinate, thus determining a certain 
curve. Again, teking a different origin, we may lay off €2 as 
abscissas measured to the left, and detennine a second curve by 
erecting the ordiiiiites We may suppose the distance be¬ 
tween the origins to be 6531 the second origin being to the right 
if is positive. We may deteimine a third curve by erecting 
at every point in the line (between the least values of ei and 62 ) 
an onlinate wliieh represents the product of the two oidinates 
belonging to the curves alrejidy described. The area between 
this thinl curve and the axis of abscissas will represent the value 
of To get the value of tliis quantity for vaiying values 

of 6^21 we may suppose tlie first two curves to he rigidly con¬ 
structed, and to be capable of being moved independently. We 
may increase or diminish i^oving one of these curves to 

the right or left. The thirel curve must be constructed anew 
for each different value of 
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THE FUNCTION 0 AND THE CANONICAL DISTRIBUTION. 

In this chapter we .shall return to the consideration of the 
canonical distribution, in order to investigate those properties 
wdiich are especially related to the function of the energy 
which we have denoted by </>. 

If we denote by iV, as usual, the total number of systems 
in the ensemble, 

T 

A ci ® de 


will represent the number having energies between the limits 
€ and e + (fe. The expression 




Ne 




(317) 


represents what may be called the density-in-cnergy. This 

vanishes for e = Qc, for otherwise the necessary equation 

< = 00 \L~€ 

I e® d€=zl (318) 

could not be fulfilled. For the same reason the densitjMn- 
energy will vanish for £ = — oo, if tliat is a possible value of 
the energy. Generally, however, the least possible value of 
the energy will be a finite value, for which, if n > 2, will 
vanish,* and therefore the density-in-energy. Now the density- 
in-energy is necessarily positive, and since it vanishes for 
extreme values of the energy if 7i > 2, it must have a maxi¬ 
mum in such cases, in which the energy may be said to have 


* See page 1X5. 
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its most common or most probable value, and which is 
determined by the equation 


^ _ I 

de 0* 


(319) 


This value of d<f>ld€ is also, when n > 2, its average value 
111 the ensemble. For we have identically, by integration by 
parts, 


f =:OD 


J de 


-+<t> 


de = 



(320) 


If w > 2, the expiession in the brackets, which multiplied by N 
would represent the density-iu-energy, vanishes at the limits, 
and we have by (269) and (318) 


cl<f> 1 

de 0* 


(321) 


It appears, therefore, that for systems of more than two degrees 
of freedom, the average value of d(f)/ d€ in an ensemble canoni¬ 
cally distributed is identical with the value of the same differ¬ 
ential coefficient as calculated for the most common energy 
in the ensemble, both values l)eing reciprocals of the modulus. 

Hitherto, in our consideration of the quantities F", Vp, <f>. 
<fipy we have regarded the external cobniinates as constant. 
It is evident, however, from their definitions that F and (f) are 
in general functions of the external coordinates and the energy 
(e), that Vg and (f)g are in general functions of the external 
coordinates and the potential energy (e^). Vp and (f)p we have 
found to be functions of the kinetic energy (ep) alone. In the 
equation 



r=o 


by which y[r may be determined, © and the external coordinates 
(contained implicitly in <f)) are constant in the integration. 
The equation shows that yjr is a function of these constants. 
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If their values are varied, we shall have by differentiation, if 
n > 2, 

1 ‘‘*+9 ‘'®)=9 ''® 7 ‘•’~’*** 

v=o 

t=raO f f=:CC f 

+ drti /"t“ + da^ i'p- e~'^^^de + etc. (323) 

dcii da 2 

v—o r=o 

(Since vanishes with V, when > 2, there are no terms due 
to the variations of the limits.) Hence by (2G9) 

-1 ^ 5 

or, since {?^25) 

dip = 7) d(!t> ^ dtti — ® da2 — 

dttj^ da^ 

Comparing this with (112), we get 

dcp ?7^ __ 7?^ 

da2~~^^ 

The first of these equations might be written* 

7^4>\ ^_77±\ (^\ 

\ ) f,a Ja \ d(ix )a^q 

but must not be confounded with the equation 

\d<h)€,a \de)a 

which is derived imraediatety from the identity 

\<^<hj€,a \deja \da^)^^a 

* See equations (321) and (104). Suffixes are here added to the differential 
coefficients, to make the meaning perfectly distinct, although the same quan¬ 
tities may be written elsewhere without the suffixes, when it is believed that 
there is no danger of misapprehension. The suffixes indicate the quantities 
which are constant in the differentiation, the single letter a standing for all 
the letters Ui, « 2 i except the one which is explicitly varied. 


(326) 

(327) 

(328) 

(329) 

(330) 
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Moreoyer, if we elirainate dyjr from (326) by the equation 
dip ~ @dr) -f rjd& -f- dCf (331) 

obtained by differentiating (325), we get 

de ~ %dy} — &^^dai @ ^^da^ — etc., (332") 

dai da^ ^ 

or by (321), 

- ^ da^ + etc. (333) 

Except for the signs of average, the second member of this 
equation is the same as that of the identity 

d(p de A- ^ -f da^ -f etc. (334) 

de da I qaj 

For tfie more precise comparison of these etjuations, we may 
suppose that the energy in the last equation is some definite 
and fairly representative energy in the ensemble. For this 
purpose we might clioose the average energy. It will per- 
haj,)s be more convenient to choose the most common energy, 
which we shall denote by €^, The same suffix will be applied 
to functions of the energj' detxinnined for this value. Our 
identity then becomes 

It has been shown that 


< 1 ^ 

de 



(336) 


when n > 2. Moreover, since the external coordinates have 
constant values Uiroughout the ensemble, the values of 
dcplda^,, d<j>jda^^ etc. vary in the ensemble only on account 
of the variations of the energy (e), which, as we have seen, 
may be regarded as sensibly constant throughout the en¬ 
semble, when n is very great. In this case, therefore, we may 
regard the average values 


dO/i 


dcL^ 


, etc., 
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as practically equivalent to the values relating to the most 
coinnion energy 

(£)o’ (5^)0’ 

In this case also de is practically equivalent to d€^y We have 
therefore, for very large values of n, 

— drj ~ d<j>Q (337) 


appioxirnately. That is, except for an a^lditive constant, — r/ 
may be regarded as practically e(]uival(int to (f>Q, when the 
number of degrees of freedom of th(.* system is very great. 
It is not meant loy tliis that the variable part of v + is 
iiuiiuu'ieally of a lower oixler of magnitude than unity. For 
when n is very great, — r; and (f>^y are v ery great, and we can 
only conclude that tlui variable [lart oi 7/ -f is insignifi¬ 
cant cnmpared with the variable part of tj or of taken 
separately. 

Now' Ave have already noticed a certain correspondence 
between the quantities B and ?; and those which in thermo¬ 
dynamics are called tempe^rature and entropy. The property 
just demonstrated, wdth those expressed by equation (33G), 
therefore suggests that the quantities <f> and defdcl) may also 
correspond U) the themiCKlynainic notions of entrojyy and tem¬ 
perature. We leave the dis(*ussion of this point to a sub¬ 
sequent chapter, and only mention it here to justify the 
somewhat detailed investigation of the redations of these 
quantities. 

We may get a clearer view of the limiting foim of tin* 
relations wdien the numl)er of degrees of fr(^edom is indefi¬ 
nitely increased, if we expand the function </> in a series 
arranged according to ascending powers of e — €q. This ex¬ 
pansion may be written 


</> = </>() 4 - 




(^\ (e —£ 0 )* 
V d^Jo [3 


+ etc. 

(338) 


Adding the identical equation 
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^ -- €o ^ e -- e-Q 

^ ~ (H) 0 > 


we get by (336) 


0 


+ ^ = *^ + 4^ + 


Substituting this value in 


— - €o)® 

WeV 2 '^[dey 


+ etc. 
(339) 


t" if- — «• 



+ <f> 


cZe, 


which expresses the prol)abiiity that the energy of an unspeci¬ 
fied system of the ensemble lies between the limits e' and e", 
we get 




4-00 C 


<7yfO„ 2 |3' 


•+ etc. 


de. 


(340) 


When the number of degrees of freedom is very great, and 
e — c,j in consequence very small, we may neglect the higlier 
powers and wnite"*^ 



c 


This shows that for a very great number of degrees of 
freedom the probability of deviations of energy from the most 
probable value approaches the form expressed by the 

‘ huv of errors.’ With this approximate law^, we get 


* If a higher dt^grco of accuracy is desired than is afforded by this formula, 
it may be multiplied by the series obtained from 



l3 


4- etc. 


by the ordinary formula for the expansion in series of an exponential func¬ 
tion. There would be no especial analytical difficulty in taking accc>unt of 
a moderate number of terms of such a series, which would ccmmence 


1 + 


/(< - ■ 
W)o 12 


'd^J>\ (g - 

ii 


+ etc. 
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whence 



(342) 

(343) 

(344) 


Now it has been proved in Chapter VII tliat 


We have therefore 




n d€j, 


^ + <^o = ^ + <^0 = - Oog (2^(e - e)“) = - i log 

"(345) 

approximately. The order of magnitude of t/ — <f>Q is there¬ 
fore that of log ru This magnitude is iiuiiiily (constant. 
The order of magnitude of rj + <f>Q — I log n is that of unity. 
The order of magnitude of , and therefore of — r;, is that 
of n* 


Equation (338) gives for the first approximation 


<t> — <po " 



-h 


(<t> - <t>o) (€ - eo) = 


(«-£„)»_*- 


0 


de. 


(346) 

(347) 




(e — ep)* _ ” 

~2dl 


(348) 


The members of the last equation have the order of magnitude 
of w. Equation (338) gives also for the first approximation 


d<f> 

de 


1 

0 



(f — €b), 


* Compare (289), (314). 
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whence 


- 1 ) 


_ iy= f ^ . 

\cl€ @y \d€^ Jo (e- eof \d€^Jo 


(349) 

(350) 


This is of the order of magnitude of n* 

It should be observed that the aj)})roximate distribution of 
the ensemble in energy according to the ‘law of errors’ is 
not dependent on the particular form of the function of the 
energy which we have assumed for the index of probability 
(?;). In any c.ase, we must have 

f = 00 

I d€ = 1, (351) 

r‘r0 

where is necessarily positive. This requires that it 
shall vanish for e == oo , and also for 6 = — go , if this is a possi¬ 
ble vahie. It has been shown in tlu^ last chapter that if € has 
a (finite) least possible value (which is tlje usual case) and 
n > 2, will vanish for that least value of e. In geneml 
therefore 77 -f ^\ill have a maximum, which determines the 
most probable value of tlie en(;rgy. If we denote this value 
by O), and distinguish the corres[)onding values of the func¬ 
tions of tlie energy by tlie same suffix, we shall have 

The probability that an unspecified system of the ensemble 


* We shall find hereafter that the equation 



is exact for any value of n greater than 2, and that the equation 
/d<p ^ 

is exact for any value of n greater than 4. 
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fills within any given limits of energy (e' and e") is repre¬ 
sented by 



If we expand rj and <f> in aseeuding powers of e — without 
going teyuiid the s<piares, the probability that the energy falls 
within tlie given limits bikes tlie form of the ‘ law of errors ’ — 




de. 


Tills gives 


Vo 


and 




(353) 


(354) 

(355) 


We shiill have ii close ap[)roximation in genenil wlien the 
quantities e([uated in (855) are very small, i. e., when 



is very great. Now when n is very great, — d'^<f)Jde^ is of the 
same order of magnitude, and the condition that (356) shiill 
be very great does not restrict very much the nature of the 
function 77 . 

We may obtain other properties pertaining to average values 
in a canonii’al ensemble by the method used fc.r the average of 
d(f>ld€. Let u be any function of the energy, either alone or 
with B and the exteruid coordinates. The average value of u 
in the ensemble is determined by the equation 


u =/“ 

F=0 


1 - ^ ^ 

J » d€. 


( 357 ) 
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Now we have identically 



V=iO 


Therefore, by the preceding equation 


Su 


U . ddi 



tzf 

0 



♦(359) 


If we set w = 1, (a value which need not be excluded,) the 
second member of this equation vanishes, as shown on page 
101, if n > 2, and we get 


1 

• de 0 ^ 


(360) 


as before. It is evident from the same considerations that the 
second mendxT of (359) will always vanish if n > 2, unless u 
becomes inlinite at one of the limits, in which case a more care- 
f'ul examination of the value of the expression will ])e necessary. 
To fa(*i[itate the discussion of such cases, it wall l>e convenient 
to introduce a certiiin limitation in regard to the nature of the 
system considered. We have necessarily supposed, in all our 
treatment of sysUmis canonically distributed, that the system 
considered tvas such as to be capable of the canonical distri¬ 
bution with the given value of the modulus. We sliall now 
suppose that the system is such as to be capable of a canonical 
distribution with any (finite) f modulus. Let us see what 
cases we exclude by this last limitation. 


* A more general equation, which is not limited to ensembles canonically 
distributed, is 



where 77 denotes, as usual, the index of probability of phase. 

t The term/iniVe applied to the modulus is intended to exclude the value 
zero as well as inlinity. 
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The impossibility of a canonical distribution occurs when 
the equation 



v=o 


(361) 


fails to determine a finite value for \}r. Evidently the equation 
cannot make an infinite positive quantity, tlie impossibility 
therefore occurs when the equation makes = — oo . Now 
we get easily from (191) 



If the canonical distribution is possible for any values of 0, 
we can apply this equation so long as the canonical distribu¬ 
tion is possible. The equation shows that as 0 is increased 
(without becoming infinite) — canndt become infinite unless 
€ simultaneously becomes infinite, and that as 0 is decreased 
(without becoming zero) — yjr cannot become infinite unless 
simultaneously e becomes an infinite negative quantity. The 
corresponding cases in thermodynamics would be bodies which 
could absorb or give out an infinite amount of heat ^vithout 
passing certain hmits of temperature, when no external work 
is done in the positive or negative sense. Such infinite values 
present no analytical difficulties, and do not contradict the 
general laws of mechanics or of thermodynamics, but they 
are quite foreign to our ordinary experience of nature. In 
excluding such cases (which are certainly not entirely devoid 
of interest) we do not exclude any which are analogous to 
any actual cases in thermodynamics. 

We assume then that for any finite value of 0 the second 
member of (361) has a finite value. 

When this condition is fulfilled, the second member of 
(359) will vanish for u = V. For, if we set 0' = 20, 
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where denotes the value of ^|r for the modulus 0^ Since 
the last member of this formula vanishes for e = oo, the 
less value represented by the first member must also vanish 
for the same value of e. Therefore the second member of 
(359), which differs only by a constant factor, vanishes at 
the upper limit. The case of the lower limit remains to be 
considered. Now 


€ 



« € 



The second member of this formula evidently vanishes for 
the value of e, which gives F — 0, whether this be finite or 
negative infinity. Therefore, the second member of (359) 
vanishes at the lower limit also, and we have 


ae 




.. 

6 (362) 


This equation, which is subject to no restriction in regard to 
the value of ??, suggests a connection or analogy between the 
function of the energy of a system which is represented by 
^-0 F and the notion of temperature in thermodynamics. We 
shall return to this subject in (Chapter XIV. 

If n > 2, the second member of (359) may easily be shown 
to vanish for any of the following values of u viz.: </>, e, 
6*”, where m denotes any positive number. It will also 
vanish, when n > 4, for u — d<f>/de, and when n > 2h ior 
u = d^V/de^. When tlie second member of (359) van¬ 
ishes, and n > 2, we may write 



d<l> 

de 


1 \ d<f> u 


We thus obtain tlie following equations: 
If n > 2, 




de 


± 

0 


du 
de * 


1 

“ 0 ^ 


(363) 


(364) 
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Ce* _ ?) = e* ^ - 

^ ' \de ®) de ® 


d€ 


or 


2e* ^ = 

de de ® ' 



(365) 

(3663 

•(367) 

(368) 

t(369) 


If n > 2h, 

-<P d^ Vd^ 1 af*+i V 

^ rfc* C?6 © ^ cfc* ^ C^C* rfe ^ 


or 


-0 ' V 1 ^^d^V 

* de*+» " 0 ® rfe* 


(370) 


whence 






1 


Giving A the values 1, 2, 3, etc., we have 


d<f> ^ 1 

X"* ©' 


if n > 2, 


rfe« V *) ~®* 


if n > 4, 


as already obtained. Also 

^ /^Y_ 1 

de* de* de ■*■ Vdej " G* 


if » > 6. 


(371) 


(3T2) 


* This equation may also be obtained from equations (262) and (321). 
Compare also equation (349) which was derived by an approximative method, 
t Compare equation (360), obtained by an approximative method. 
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If is a continuous increasing function of commencing 
with = 0, the avemge value in a canonical ensemble of any 
function of either alone or with the modulus and tlie exter¬ 
nal coordinates, is given by equation (275), which is identical 
with (357) except that e, </>, and have the suffix ( )^. The 
equation may be tmnsfomied so as to give an equation iden¬ 
tical with (359) except for the suffixes. If we add the same 
suffixes to equation (361), the finite value of its members will 
detennine the possibility of the canonical distiibution. 

From these datii, it is easy to derive ecpiations similar to 
(360), (362)-(872), except that the conditions of their vahd- 
ity must be differently stated. The equation 



requires only the condition already mentioned with resj'>ect to 
Vq. This ecjuation corresponds to (362), wdiich is subject to 
no restriction with respect to the value of n. We may ol> 
serve, how'ever, that Fwill always satisfy a condition similar 
to that mentioned with respect to I^. 

If Vq satisfies the condition mentioned, and a similar 
condition, i. e., if e'^<j is a continuous increasing function of 
conunencing with the value — 0, equations wiU hold sim¬ 
ilar to those given for the case when n > 2, viz., similar to 
(360), (364)-(368). Especially important is 

1 

d€q 0 ’ 

If Vqj (or dVqjde^^ all satisfy similar conditions, 

we shall have an equation similar to (369), which was subject 
to the condition w > 4. And if d? Vqjdeq^ also satisfies a 
similar condition, we sliall liave an equation similar to (372), 
for which the condition was n > 6. Finally, if Vq and h suc¬ 
cessive differential coefficients satisfy conditions of the kind 
mentioned, we shall have equations like (370) and (371) for 
which the condition w^as n > 2 A. 
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These conditions take the place of those given above relat¬ 
ing to n. In fact, we might give conditions relating to tlie 
differential coefficients of V, similar to those given relating to 
the differential coefficients of V^, instead of the conditions 
relating to n, for the validity of equations (360), (363)-(372). 
This would somewhat extend the application of the equations. 
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ON A DISTRIBUTION IN PHASE CALLED MICROCANONI- 
CAL IN WHICH ALL THE SYSTEMS HAVE 
THE SAME ENERGY. 

An important ease of statistieal equilibrium is that in whieh 
all systems of the ensemble have the same energy. We may 
arrive at the notion of a distnbution which will satisfy the 
necessary conditions by the f(>llowing process. We may 
suj)pc)se that an ensemble is distributed vith a uniform den¬ 
sity-in-phase between two limiting values of tlu^ energy, e' and 
e", and with density zero outside of those limits. Such an 
ensemble is evidently in sUitistical equilibrium according to 
the criterion in Chapter IV, siiu‘e the density-in-j)hase may be 
regarded as a function of the energy. By diminishing tlie 
difference of e and we may diminisli the differences of 
energy in the ensemble. The limit of this process gives us 
a permanent distribution in which the energy is constant. 

We should arrive at the same result, if we should make the 
density any function of the energy Ix^tweeu the limits 6' and 
t", and zero outside of those limits. Thus, the limiting distri¬ 
bution obtained from the part of a canonical ensemble 
between two limits of energy, when the difference of the 
limiting energies is indefinitely diminished, is independent of 
the modulus, being determined entirely by the energy, and 
is identical' with tlie limiting distribution obtained from a 
uniform density between limits of energy approaching the 
same value. 

We shall call the limiting distribution at which we arrive 
by this process microcanonicaL 

We shall find however, in certain cases, that for certain 
values of the energy, viz., for tliose for which is infinite, 
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this process fails to define a limiting distribution in any such 
distinct sense as for other values of the energy. The difficulty 
is not in the process, but in the nature of the case, being 
entirely analogous to that wliich we meet when we try to find 
a canonical distribution in cases when becomes infiiiite. 
We have not regarded such cases as affoixiing true examples 
of the canonical distribution, and we shall not regard the cases 
in which is infinite as affording true examples of the micro- 
canonical distribution. We shall in fact find as we go on that 
in such cases our most important formulae become illusory. 

The use of formulae relating to a canonical ensemble which 
contain e'^de instead oidp^ ... as in the preceding chapters, 
amounts to tlie consideration of the ensemble as divided into 
an infinity of microcanonical elements. 

FromacerUiin point of view, the microcanonical distribution 
may seem more sim]..)le than the canonical, and it has perhaps 
been more studied, and been regarded as more closely related 
to the fundamental notions of thermodymimics. To this last 
point we shall return in a subsequent chapter. It is sufficient 
here to remark that analytically the canonical distribution is 
mu('h more manageable than the microcanonical. 

We may sometimes avoid difficulties which the microcanon- 
ical distribution presents by regaixling it as the result of the 
following process, which involves conceptions less simple but 
more amenable to analytical treatment. We may suppose an 
ensemble distributed with a density proportional to 

. , 

where co and e' are constants, and then diminish indefinitely 
the value of the constant o). Here the density is nowhere 
z;ero until w^e come to the limit, but at the limit it is zero for 
all energies except e'. We thus avoid the analytical compli¬ 
cation of discontinuities in the value of the density, which 
require the use of integrals with inconvenient limits. 

In a microcanonical ensemble of systems tlie energy (e) is 
constant, but the kinetic energy (e^) and the potential energy 
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(e'^) vary in the different systems, subject of course to the con¬ 
dition 

-f- = € = constant. (373) 

Our first inquiries will relate to the division of energy into 
these two parts, and to the average values of functions of 
and 6^. 

We shall use the notation to denote an average value in 
a microcanonical ensemble of energy e. An average value 
in a canonical ensemble of jnodiihis O, which has hitherto 
been denoted by we shall in this chaj)ter denotes by to 
distinguish more clearly the two kinds of averages. 

Tlie extension-in-phase within any limits wliicli can be given 
in terms of and €g may he expr(‘ssed in the notations of the 
preceding chapter by the double integral 

fpr,„!v. 


taken wdtliin those limits. If an ensemble of systems is dis¬ 
tributed wuthin those limits with a uniform density-in-phase, 
the average value in the ensemble of any function (?f) of the 
kinetic and potential energies will be expressed by the quotient 
of integrals 


//“ 


dVpdVj 


// 


dVpdV, 


Since dV^ ~ and de^ = de w^hen €g is constant, the 

expression may be written 


If 


It e^^dedK. 




e*''dedV„ 


To get the average value of u in an ensemble distributed 
microcanonically with the energy e, we must make tlie in¬ 
tegrations cover the extension-in-phase between the energies 
€ and e -f- de. This gives 
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de j u d Yg, 

. rq=o _ 

tq=-( 

d€J e'^^dVg 

Vqz=:.0 

But by (299) the value of the integral in the denominator 
is e^. We have therefore 

€g=€ 

,7], = e’’’d V„ (374) 

Vq=:0 

where and Vg are connected by equation (378), and w, if 
given as function of 6p, or of €p and 6^, becomes in virtue of 
the same equation a function of Cg alone. 

We shall assume that has a finite value. If w > 1, it is 
evident from equation (305) that is an increasing function 
of and therefore cannot be infinite for one value of e \Ailhout 
being infinite for all greater values of e, which would make 
— i/r infinite.* When 1, therefore, if we assume that 
is finite, we only exclude such cases as we found necessary 
to exclude in the study of the canonical distribution. But 
when n = 1, cases may occur in which the canonical distribu¬ 
tion is perfectly applicable, but in which the formulae for the 
microcanonical distribution become illusory, for particular val¬ 
ues of e, on account of the infinite value of Such failing 
cases of the microcanonical distribution for particular valuer 
of the energy will not prevent us from regarding the canon¬ 
ical ensemble as consisting of an infinity of microcanonical 
ensembles, t 

* See equation (322). 

t An example of the failing case of the microcanonical distribution is 
afforded by a material point, under the influence of gravity, and constrained 
to remain in a vertical circle. The failing case occurs when the energy is 
just sufficient to carry the materia] point to the highest point of the circle. 

It will be observed that the diflSculty is inherent in the nature of the case, 
and is quite independent of the mathematical formulae. The nature of the 
difficulty is at once apparent if we try to distribute a finite number of 
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From the last equation, with (298), we get 



€g = € 

6 / VpdVq^^e V. 

VqZzO 


But by equations (288) and (289) 



Therefore 


e 


F=: 




(375) 


(376) 

(377) 


Again, with the aid of equation (301), we get 


d€. 


f<r=f 




rfe„ 


d(f> 




(378) 


VgZzO 


a n > 2, Therefore, by (289), 

These results are interesting on account of the relations of 

the functions V and ^ to the notion of temperature in 

thermodynamics, — a subject to wliich we shall return here¬ 
after. They are particular cases of a general relation easily 
deduced from equations (306), (374), (288) and (289). We 
have 


(^V 

dd" 


f„=c 

(1^ V 

~-idT\, if h <\n+l. 


=/ 




The equation may be written 

Cg=€ 

e 


-<t>dyv 

d€^ 


=. 'j. 


4>pd^V^ <f>p 






material points with this particular value of the energy as nearly as possible 
in statistical equilibrium, or if we ask: What is the probability that a point 
taken at random from an ensemble in statistical equilibrium with this value 
of the energy will be found in any specified part of the circle? 
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We have therefore 
—0 V 


de^ 


= e 








( 380 ) 


T(in-^k 4 - 1 ) 

if /i < 171 + 1. For example, when n is even, we may make 
A = 171, which gives, with (307), 


l_,: 




(381) 


Since any canonical ensemble of systems may be regarded 
as composed of microcanonical ensembles, if any ■ quantities 
u and V have the same avenige values in every microcanonical 
ensemble, they will have the same values in every canonical 
ensemble. To bring equation (380) formally under this rule, 
we may observe that the first member being a function of e is 
a constiint value in a microcanonical ensemble, and therefore 
identical with its average value. We get thus the general 
equation 

= ( 382 ) 


de^ 




je r(i^ — A-f-l) 


if A < J 77 + 1.* The equations 




1 

0 ' 


de 


ra=(i-0‘^« 


(383) 


(384) 


may l)e regarded as particular cases of the general equation. 
The last equation is subject to the condition that n > 2. 

The last two equations give for a canonical ensemble, 
if n > 2, 




(385) 


The corresponding equations for a microcanonical ensemble 
give, if 77 > 2, 


0 


(386) 


* See equation (202). 
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which shows that d(f> d log V approaches the value unity 
when n is very great. 

If a system consists of two parts, having separate energies, 
we may obhiin equations similar in fonn to the preceding, 
which relate to the system as thus divided.* We shall 
distinguish quantities relating to the parts by letters with 
suffixes, the same letters without suffixes relating to the 
whole system. The extension-in-phase of the whole system 
within any given limits of the energies may be represented by 
the double integral 

J fdV,dV, 


taken within those limits, as appears at once from the defini¬ 
tions of Chapter VIII. In an ensemble distributed with 
uniform density within those limits, and zero density outside, 
the average value of any function of Cj and ea is given by the 
quotient 


JJ udVidVi 

If dV,dV, 


which may also be written f 


de dVq 


If 


e^'dedV. 


If we make the limits of integration e and e + de, we get the 


♦ If this condition is rijforously fulfilled, the parts will have no influence 
on each other, and the ensemble formed by distributing* the wliole micro- 
canonically is too arbitrary a conception to have a real interest. The prin¬ 
cipal interest of the equations which we shall obtain will be in cases in 
which the condition is approximately fulfilled. But for the purposes of a 
theoretical discussion, it is of course convenient to make such a condition 
absolute. Compare Chapter IV, pp. 35 ff., where a similar condition is con¬ 
sidered in connection with canonical ensembles. 

t Where the analytical transformations are identical in form with those 
on the preceding pages, it does not appear necessary to give all the steps 
with the same detail. 
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average value of u in an ensemble in which the whole system 
is microcanonically distributed in phase, viz., 

, = e“* J u /’ d V,, (387) 

where (f>^ and are connected by the equation 

ei + €2 = constant — e, (388) 

and w, if given as function of ei, or of and , becomes in 
virtue of the same equation a function of €i alone.* 

Thus 

= e~^f VidVt, (389) 

e~^V= . (390) 

This requires a similar relation for canonical averages 


0 = He = = e^e- 

Again 


d€i {e 


€2=€ 

6~* J 


V,z=0 

But if Wj > 2, vanishes for = (),t and 


A 

de 


€ 



€2=€ 



•J 

v,=o 


e''dr^. 


Hence, if > 2, and /^2 > 2, 


d<f> 

de d€i c 


d<f>2 
rf62 e’ 


(391) 

(392) 


(393) 


(394) 


* In the applications of the equation (387), we cannot obtain all the results 
corresponding to those which we have obtained from equation (374), because 
<f>p is a known function of €p, while <Pi must be treated as an arbitrary func¬ 
tion of f 1 , or nearly so. 

t See Chapter VIII, equations (305) and (316). 



ALL SYSTEMS HAVE THE SAME ENERGY, 123 


and 


1 

0 <3?e 



d(j>2 

e j 

9 d€2 % 


(395) 


We have compared certain functions of the energy of the 
whole system with average values of similar functions of 
the kinetic energy of the whole system, and with average 
values of similar functions of the whole energy of a part of 
the system. We may also compare the same functions with 
average values of the kinetic energy of a part of the system. 

We shall express the total, kinetic, and potential energies of 
the whole system by e, ep, and and the kinetic energies of the 
parts by ej^, and These kinetic energies are necessarily sep¬ 
arate : we need not make any supposition concerning potential 
energies. The extension-in-phase within any limits which can 
be expressed in terms of ej^,, € 2 p may be represented in the 
notations of Chapter VIII by the triple integral 
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taken within those limits. And if an ensemble of systems is 
distributed with a uniform density within those limits, the 
average value of any function of wiU be expressed 

by tlie quotient 
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To get the average value of xi for a microcanonical distribu¬ 
tion, we must make the limits e and e -f- The denominator 
in this case becomes and we have 


ejp=0 


( 396 ) 



124 A PERMANENT DISTRIBUTION IN WHICH 


where <f>^, V^p, and are connected by the equation 


eip + e^p + e, — constant = e. 

Accordingly 

€q=^€ €^p—€-~€q 

V,p[ = e-^ J j V,pd r,p dV, = e-^ V, (397) 

€2p=0 

and we may write 

,-V= r"- V,p\ = = l-i;;]. = . (398) 

Til n^i 

and 


0 ; 


~<p 






^ ipl© — ^ 


~<p2P 


^0 — ” ^0- (399) 


Again, if > 2, 
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Hence, if > 2, and Wg > 2, 
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1 ~d<li 

__ d^ip 


© — Ze 

Q dcip 

0 <i€op 0 


(in, - l)eI7^3= (^ri2-l) V^e' 


(402) 


We cannot apply the methods employed in the preceding 
pages to the microcanonical averages of the (generali/ed) 
forces exerted by a system on external bodies, 

since these quantities are not functions of the energies, either 
kinetic or potential, of the whole or any part of the system. 
We may however use the method describexJ on page 116. 
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Let U8 imagine an ensemble of systems distributed in phase 
according to the index of probability 

where e' is any constant which is a possible value of the 
energy, except only the least value which is consistent with 
the values of the external coordinates, and c and eo are other 
constants. We have therefore 

J". . ^ dp, . . . = 1, (403) 

phases 

__ 

or « dp,... dq„ (404) 

phases 



where-denotes the average value of in those systems 
of the ensemble which have any same energy e. (This 
is the same thing as the average value of Zj in a microcanoni- 
cal ensemble of energy e.) The validity of the transformation 
is evident, if we consider separately the part of each integral 
which lies between two infinitesimally differing limits of 
energy. Integiuting by parts, we get 
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Differentiating (405), we get 
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where denotes the least value of e consistent with the exter¬ 
nal coo’rdinates. The last term in this equation represents the 
part of de~^ jda^ which is due to the variation of tlie lower 
limit of tlie integml. It is evident that the expression in the 
brackets will vanish at the upper limit. At the lower limit, 
at which €p = 0, and has the least value consistent with tlie 
external cooixiinates, the avemge sign on is superfluous, 
as there is but one value of which is represented by 
— de^Jday Exceptions may indeed occur for particular values 
of the external cooRlinates, at which dt^fda^ receive a finite 
increment, and tlie formula becomes illusory. Such particular 
values we may for the moment leave out of account. The 
last term of (408) is therefore equal to the first term of the 
second member of (407). (We may observe that both vanish 
when n > 2 on account of the fiictor e^.) 

We have therefore from tliese equations 
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That is: the average value in the ensemble of the quantity 
represented by the principal parenthesis is zero. This must 
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be true for any value of (o. If we diminish o), the average 
value of the parenthesis at the limit when &> vanishes becomes 
identical with the value for e But this may be any value 

of the energy, except the least possible. We have therefore 


de 
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= 0 , 
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unless it be for the least value of the energy consistent with 
the external coordinates, or for particular values of the ex¬ 
ternal coordinates. But the value of any term of this equar 
tion as determined for particular values of the energy and 
of the external coordinates is not distinguishable from its 
value as determined for values of the energy and external 
coordinates indefinitely near those particular values. The 
equation therefore holds without limitation. Multiplying 
by we get 
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^ de 
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The integral of this equation is 

where is a function of the external coordinates. We have 
an equation of this form for each of the external coordinates. 
This gives, with (266), for the complete value of the differen¬ 
tial of V 

dV = de •— Fi)da^ — F^da^ + etc., (413) 


or 

p^= {de 4- da^ + da^ *f etc.) — F^ da^ — F^da^ — etc. 

(414) 

To determine the values of the functions jPj , jPg, etc., let 
us suppose etc. to vary arbitrarily, while e varies so 

as always to have the least value consistent with the values 
of the external coordinates. This will make F = 0, and 
d F = 0. If n < 2, we shall have also = 0, which will 
give 


Fi = 0, = 0, etc. 


(415) 
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THE MICROCANONICAL DISTRIBUTION. 


The result is the same for any value of n. For in the varia¬ 
tions considered the kinetic energy will be constantly zero, 
and the potential energy will have the least value consistent 
with the external coordinates. The condition of the least 
possible potential energy may limit the ensemble at each in¬ 
stant to a single configuration, or it may not do so ; but in any 
case the values of etc. will be the same at each instant 

for all the systems of the ensemble,* and the equation 

de -|- Aida^ 4- A^da^, -j- etc. = 0 

will hold for the variations considered. Hence the functions 
jiPj, -Fg, etc. vanish in any case, and we have the equation 

dV~ de + e^A^dui + e^~J^tda2 4- etc., (H6) 


d log V 


de 4- A^^da^ 4" etc. 


or again 


V d log V — — ^\^da 2 — etc. 


(417) 

(418) 


It will l)e observed that the two last equations have the fomi 
of the fundamental differential equations of thermodynamics, 
corresponding to temperature and log V to entropy. 
We have already observed propeities of e~'^V suggestive of an 
analogy with temperature, f The significance of these facts 
wull be discussed in another chapter. 

The two last equations might \ye written more simply 


A- da^ 4- doz 4- etc. 

€rA> ’ 

de = dV — da^ — A^t da^ — etc.. 


and still have the form analogous to the thermodynamic 
equations, but has nothing like the analogies with tempera¬ 
ture which we have observed in e~^V. 


* This statement, as mentioned before, may have exceptions for particular 
values of the external coordinates. This will not invalidate the reasoning, 
which has to do with varying values of the external coordinates, 
t See Chapter IX, page 111; also this chapter, page 119. 



CHAPTER XL 


MAXIMUM AND MINIMUM PROPERTIES OF VARIOUS DIS- 
TRIBUTIONS IN PHASE. 

In the following theorems we suppose, as always, that the 
systems forming an ensemble are identical in nature and in 
the values of the external coordinates, which are here regarded 
as constants. 

Theorem L If an ensemble of systems is so distributed in 
phase that the index of probability is a function of the energy, 
the average value of the index is less than for any other distri¬ 
bution in which the distribution in energy is unaltered. 

Let us write rj for the index which is a function of the 
energy, and 77 + A?/ for any other which gives the same dis¬ 
tribution in energy. It is to be proved that 

aU aU 

J. . . J (, + A,;) dp,... dq., >J...Jr,d’ dp,... dq„, (419) 

phases phases 

where 7 ; is a function of the energy, and Arj a function of the 
pliase, which are subject to the conditions that 

all aU 

J... .J dp,... dq, =/•••/ e” dp,... dq, = 1, (420) 

phases phases 

and that for any value of the energy (e') 

J. . .J dp,. .. dq., ~J -J ■••dq„. (421) 

p=.' 

Equation (420) expresses the general relations which tj and 
ri + ^ 1 } must satisfy in order to be indices of any distributions, 
and (421) expresses the condition that they give tlie same 
distribution in energy. 
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Since 77 is a function of tlie energy, and may therefore be re¬ 
garded as a constant witliin the limits of integration of (421), 
we may multiply by rj under the integral sign in both mem¬ 
bers, which gives 

J. . .j . . .dq^ — n^dPx ■ • • dq^- 

Since this is true within the limits indicated, and for every 
value of e', it will be true if the integrals are taken for all 
phases. We may therefore cancel the corresponding parts of 
(419), which gives 

aU 

y*. . dpy^ .. . > 0. (422) 

phases 

But by (420) this is equivalent to 

aU 

^. .. j'(A 1 ) 6 ^^ + 1 — e^'^)e'dpi .. .dq„> 0. (423) 

phases 

Now At; + 1 — is a decreasing function of At; for nega¬ 
tive values of At;, and an increosing function of At; for positive 
values of At;, It vanishes for A?; = 0. The expression is 
therefore incapable of a negative value, and can have the value 
0 only for At; = 0. The inequality (423) will hold therefore 
unless At; = 0 for all phases. The theorem is therefore 
proved. 

Theorem IL If an ensemble of systems is canonically dis¬ 
tributed in phase, the average index of probability is less than 
in any other distribution of the ensemble having the same 
average energy. 

For the canonical distribution let the index be (-^/r — e) / 0, 
and for another having the same average energy let the index 
be (-^ — e) /© -f- At;, where At; is an arbitrary function of the 
phase subject only to the limitation involved in the notion of 
the index, that 
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all \L-( all 

=J. . .Je ^ dpi . . . dq„ = l, 


phases 


phases 


(424) 


and to that relating to the constant average energy, that 

all all 

J'" ‘ * ** dpi. . .dq,=ij. . . J'e e ® dp^ ... dq^. (425) 


phases 

It is to be proved that 

all 


phases 


J -• -fit -1 + ® •■■dq.> 


phases 


all ^f 

phases 

Now in virtue of the first condition (424) we may cancel the 
constant term / 0 in the parentheses in (426), and in virtue 
of the second condition (425) we may cancel the term e/0. 
The proposition to be proved is thus reduced to 

all ^—f ^ 


/••/ 

phases 


Arje 


! 


dpi. , . dq^ > 0 , 


which may be written, in virtue of the condition (424), 




til 

+ 1 - e'^”) e ® dpi. ..dq„> 0. (427) 


phases 


In this form its truth is evident for the same reasons which 
applied to (423). 

Theorem III. If 0 is any positive constant, the average 
value in an ensemble of the expression 77 + ^ / © (^ denoting 
as usual the index of probability and e the energy) is less when 
the ensemble is distributed canonically with modulus 0, than 
for any other distribution whatever. 

In accordance with our usual notation let us writt^ 
( 7 ^ — e) /0 for the index of the canonical distribution. In any 
other distribution let the index be (-^ — e) / 0 + A 17 . 
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In the canonical ensemble 17 + e / 0 has the constant value 
i/r / 0 ; in the other ensemble it has the value / 0 -j- A?/. 
The proposition to be proved may therefore be written 

^ <J'- • -y (I + ^ "dpi .. . (428) 

phAaes 

where 


aU il'-f 


aU 


I'.. .Je ® dpi...dq,=J'...J'e ^ dpi...dq^=l. (429) 

phases phases 

In virtue of this condition, since >/r / 0 is constant, the propo¬ 
sition to be proved reduces to 

aU I#' —€ 


0 < 


phases 


-f At; 


dqi.. . dp ^, 


(430) 


where the demonstration may be concluded as in the last 
theorem. 

If we should substitute fc»r the energy in the preceding 
theorems any other function of the phase, the theorems, 
tatis mutandis^ would still hold. On account of the unique 
importance of the energy as a function of the phase, the theo¬ 
rems as given are especially worthy of notice. When the case 
is such that other functions of the phase have important 
properties relating to statistical equilibrium, as described 
in Chapter IV,* the three following theorems, which are 
generalizations of the preceding, may be useful. It will be 
sufficient to give them without demonstration, as the principles 
involved are in no respect different. 

Theorem IV. If an ensemble of systems is so distributed in 
phase that the index of probability is any function of jP^, 
etc., (these letters denoting functions of the phase,) the average 
value of the index is less tlrnn for any other distribution in 
phase in which the distribution with respect to the functions 
etc. is unchanged. 


♦ See pages 37-41. 
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Theorem V. If an ensemble of systems is so distributed 
in phase that the index of probability is a linear function of 
-f 2 ? etc., (these letters denoting functions of the phase,) the 
average value of the index is less than for any other distribu¬ 
tion in which the functions i^j, i^ 2 » etc. have the same average 
values. 

Theorem VI. The average value in an ensemble of systems 
of 7 ; + (where 77 denotes as usual the index of probability and 
F any function of the phase) is less when the ensemble is so 
distributed that 77 -f jP is constant than for any other distribu¬ 
tion whatever. 

Theorem VIL If a system which in its different phases 
constitutes an ensemble consists of two parts, and we consider 
the avemge index of probabihty for the whole system, and 
also the average indices for each of the parts taken separately, 
the sum of the average indices for the parts 'v^ill be either less 
thiin the average index for the whole system, or equal to it, 
but cannot be greater. The limiting case of equality occurs 
when the distribution in phase of each part is independent of 
that of the other, and only in this case. 

Let the cobixlinates and momenta of the whole system be 
JPi»• • • of which * -qmP^y -Pm relate to one 
part of the system, and q ^^,. . . ,. . . to the other. 

If the index of probability for the whole system is denoted by 
77, the probability that the phase of an unspecified system lies 
within any given hmits is expressed by the integral 

J- -J ■ ••^9. (431) 

taken for those limits. If we set 


J. Je" . ..dq„- e\ (432) 


where the integrations cover all phases of the second system, 
and 


I •/«' 


• ^Pm • • • dq„ — e , 


(433) 
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where the integrations cover all phases of the first system, 
the integral (431) will reduce to the form 



dfpi . . . dp^dq^ .. 




(434) 


when the limits can be expressed in terms of the coordinates 
and momenta of the first part of the system. The same integral 
will reduce to 

/• • dpm-i-i ^ dp^ dq^^i . . . dq^, (435) 

when the limits can be expressed in terms of the coordinates 
and momenta of the second part of the system. It is evident 
tJiat T/j and are the indices of probability for the two parts 
of the system taken separately. 

The main proposition to be proved may be written 



(436) 


where the first integral is to be taken over all phases of the first 
part of the system, the second integral over all phases of the 
second part of tlie system, and the last integral over all phases 
of the whole system. Now we have 



J'.. .J'e’djh . ■ ■ dqn = 

(437) 


j -‘j = 1. 

(438) 

and 

I 'I.. = 

(439) 


where the limits cover in each case all the phases to which the 
variables relate. The two last equations, which are in them¬ 
selves evident, may be derived by partial integration from the 
first. 
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It appears from the definitions of and 773 that (486) may 
also be written 


J'- --Jv! e’d?! • • • <^g!n +J'‘ • . . . dq„ < 

J.. .J'rie''dpi...dq„ (440) 

or J'- • J'(v — Vi — ’?»)«'’^ 0, 

where the integrations cover all phases. Adding the equation 
J*. . .Jdpy...dq„ = 1 , (441) 


which we get by multiplying (438) and (439), and subtract¬ 
ing (437), we have for the proposition to be proved 

aU 

/• • J [(V - vi - ’?*) e” + - e”] dp,... dq^ > 0. (442) 

phases 


Let 


U = rj — rji — r]2. 

The main proposition to be proved may be written 

aU 

J’(m e* + 1 - €“) e’'*+’'* dp,... dq., > 0. 

phases 



(443) 

(444) 


This is evidently true since the quantity in the parenthesis is 
incapable of a negative value.* Moreover the sign = can 
hold only when the quantity in the parenthesis vanishes for 
aU phases, i. e., when u = 0 for all phases. This makes 
V = Vi Vi for phases, which is the analytical condition 
which expresses that the distributions in phase of the two 
parts of the system are independent. 

Theorem VJIL If two or more ensembles of systems which 
are identical in nature, but may be distributed differently in 
phase, are united to form a single ensemble, so that the prob¬ 
ability-coefficient of the resulting ensemble is a linear function 


♦ See Tlieorem I, where this is proved of a similar expression. 
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of the probability-coefficients of the original ensembles, the 
average index of probability of the resulting ensemble cannot 
be greater than the same linear function of the average indices 
of the original ensembles. It can be equal to it only when 
the original ensembles are similarly distributed in phase. 

Let 1 -P 2 ’ ^ probability-coefficients of the original 

ensembles, and P that of the -ensemble formed by combining 
them; and let , etc. be the numbers of systems in the 

original ensembles. It is evident that we shall have 

P = Cl Pi + C 2 P 2 4- etc. = :S(ciPi), (445) 

N N 

where ^ 

The main proposition to be proved is that 

all all 

f-fp log Pdpi pi J- • J*Pi log Pi dpi... dqi, 

phases L phases 

(447) 

all 

or (ciPi log Pi) — P log P] dpi... dq„ > 0. (448) 

phases 

If we set 

Q, = Pi log Pi ~ Pi log P - Pi + p 

will be positive, except when it vanishes for — P. To 
prove this, we may regard Pj and P as any positive quantities. 
Then 

/^A 

VdPiVp-A’ 

Since and dQ^jdP^ vanish for = P, and the second 
differential coefficient is always positive, must be positive 
except when Pj = P. Therefore, if Q^, etc. have similar 
definitions. 


2(ci^x) >0. 


( 449 ) 
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But since 5 (ci Pi) = P 

and 2 Cl = 1 , 

2 (Cl ft) = 2 (Cl Pi log Pi) - P log P. (460) 

This proves (448), and shows that the sign = will hold only 
when 

Pi = p, Pa^P, etc. 

for all phases, i. c., only when the distribution in phase of the 
original ensembles are all identical. 

Theorem IX, A unifoim distribution of a given number of 
systems within given limits of phase gives a less average index 
of probability of phase than any other distribution. 

Let 77 be the constant index of the uniform distribution, and 
7) + At) the index of some other distribution. Since the num¬ 
ber of systems within the given limits is the same in the two 
distributions we have 

S' ’S dpi - ‘ ■J'e'’•• • (451) 

where the integrations, like those which follow, are to be 
taken within the given limits. The proposition to be proved 
may be written 

J. d'dp, . . . dq^, (452) 

or, since r) is constant, 

+ At;) . . . dq^>J. . .Jr)dp^ . . . dq^. (453) 

In (451) also we may cancel the constant factor and multiply 
by the constant factor (t) -f 1). This gives 

J • • Jiv + 1) dpi, . . dq^ ” S '\-l)dpi . , . dq^. 

The subtraction of this equation wll not alter the inequality 
to be proved, which may therefore be witten 

J". . . J{Ar) — 1) dp^. . . dq^ ^ •• •<kn 
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. . .J (At, + 1) dp^ ...dq„>0. (454) 

Since the parenthesis in this expression represents a positive 
value, except when it vanishes for At, = 0, the integral will 
be positive unless A?, vanishes everywhere within the limits, 
which would make the difference of the two distributions 
vanish. The theorem is therefore proved. 




CHAPTER XII. 


ON THE MOTION OF SYSTEMS AND ENSEMBLES OF SYS^ 
TEMS THROUGH LONG PERIODS OF TIME. 

An important question which suggests itself in regard to any 
case of dynamical motion is whether the system considered 
will return in the course of time to its initial phase, or, if it 
will not return exactly to that phase, whether it will do so to 
any required degree of approximation in the course of a suffi¬ 
ciently long time. To be able to give even a partial answer 
to such questions, we must know something in regard to the 
dynamical nature of the system. In the follomng theorem, 
the only assumption in this respect is such as we have found 
necessary for the existence of the canonical distribution. 

If w^e imagine an ensemble of identical systems to be 
distributed with a uniform density throughout any finite 
exteiision-in-phase, the number of the systems which leave 
the extension-in-phase and will not return to it in the course 
of time is less than any assignable fraction of the whole 
number; provided, that the tohxl extension-in-phase for the 
systems considered between two limiting values of the energy 
is finite, these limiting values being less and greater respec¬ 
tively than any of the energies of the firstrmentioned exten¬ 
sion-in-phase. 

To prove this, we observe that at the moment which we 
call initial the systems occupy the given extension-in-phase. 
It is evident that some systems must leave the extension 
immediately, unless all remain in it forever. Those systems 
which leave the extension at the first instant, we shall call 
the front of the ensemble. It will be convenient to speak of 
this front as generating the extension-in-phase through which it 
passes in the course of time, as in geometry a surface is said to 
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generate the volume through which it passes. In equal times 
the front generates equal extensions in phase. This is an 
immediate consequence of the principle of conservation of 
extension-in-phase^ unless indeed we prefer to consider it as 
a slight variation in the expression of that principle. For in 
two equal short intervals of time let the extensions generated 
be A and B. (We make the intervals short simply to avoid 
the complications in the enunciation or interpretation of the 
principle which would arise when the same extension-in-phase 
is generated more than once in the interval considered.) Now 
if we imagine that at a given instant systems are distributed 
throughout the extension it is evident tliat the same 
systems will after a certain time occupy the extension 
which is tlierefor(i equal to A in virtue of the principle cited. 
The front of the ensemble, tlierefore, goes on generating 
equal extensions in equal times. But these extensions are 
included in a finite extension, viz., that bounded by certain 
limiting values of the energy. Sooner or later, therefore, 
the front must generate phases which it has before generated. 
Such second generation of the same phases must commence 
with the initial phases. Therefore a portion at least of the 
front must return to the original extension-in-phase. The 
same is of course true of the portion of the ensemble which 
follows that portion of the front through the same phases at 
a later time. 

It remains to consider how large the portion of the ensemble 
is, which will return to the original extension-in-phase. There 
can be no portion of the given extension-in-pliase, the systems 
of which leave the extension and do not return. For we can 
prove for any portion of the extension as for the whole, that 
at least a portion of the systems leaving it will return. 

We may divide the given extension-in-phase into parts as 
follows. There may be parts such that the systems within 
them will never pass out of them. These parts may indeed 
constitute the whole of the given extension. But if the given 
extension is very small, these parts will in general be non¬ 
existent. There may be parts such that systems within them 
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will all pass out of the given extension and all return within 
it. Tlie whole of the given extension-in-phase is made up of 
parts of these two kinds. This does not exclude the possi- 
bihty of phases on the boundaries of such parts, such that 
systems starting with those phases would leave the extension 
and never return. But in the supposed distribution of an 
ensemble of systems with a uniform density-in-phase, such 
systems would not constitute any assignable fraction of the 
whole number. 

These distinctions may be illustrated by a very simple 
example. If we consider the motion of a rigid body of 
which one point is fixed, and which is subject to no forces, 
we find three cases. (1) The motion is periodic. (2) The 
system will never return to its original phase, but will return 
infinitely near to it. (3) The system will never return either 
exactly or approximately to its original phase. But if we 
consider any ex tension-in-phase, however small, a system 
leaving that extension will return to it except in the case 
called by Poinsot * singular,’ viz., when the motion is a 
rotixtion about an axis lying in one of two planes having 
a fixed position relative to the rigid body. But all such 
phases do not constitute any true extension-in-phase in the 
sense in which we have defined and used the term.* 

In the same way it may be proved that the systems in a 
canonical ensemble which at a given instant are contained 
within any finite extension-in-phase will in general return to 

* An ensemble of systems distributed in phase is a less simple and ele¬ 
mentary conception than a single system. But by the consideration of 
suitable ensembles instead of single systems, we may get rid of the incon¬ 
venience of having to consider exceptions formed by particular cases of the 
integral equations of motion, these cases simply disappearing when the 
ensemble is substituted for the single system as a subject of study. This 
is especially true when the ensemble is distributed, as in the case caUed 
canonical, throughout an extension-in-phase. In a less degree it is true of 
the microcanonical ensemble, which does not occupy any extension-in-phase, 
(in the sense in which we have used the term,) although it is convenient to 
regard it as a limiting case with respect to ensembles which do, as we thus 
gain for the subject some part of the analytical simplicity which belongs to 
the theory of ensembles which occupy true eztensions-in-phase. 
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that extension-in-phase, if they leave it, the exceptions, L e., 
the number wliich pass out of the extension-in-phase and do 
not return to it, being less than any assignable fraction of the 
whole number. In other words, the probability that a system 
taken at random from the part of a canonical ensemble which 
is contained within atiy given extension-in-phase, will pass out 
of that extension and not return to it, is zero. 

A similar theorem may be enunciated with respect to a 
microcanonical ensemble. Let us consider the fractional part 
of such an ensemble which lies within any given limits of 
phase. This fraction we shall denote by F. It is evidently 
constant in time since the ensemble is in statistical equi¬ 
librium. The systems within the limits will not in general 
remain the same, but some will pass out in each unit of time 
while an equal number come in. Some may pass out never 
to return within the limits. But the number which in any 
time however long pass out of the limits never to return will 
not bear any finite ratio to the number within the limits at 
a given inshint. For, if it were otherwise, let / denote the 
fraction representing such ratio for the time T, Then, in 
the time 7", the number which pass out never to return will 
bear the ratio f F to the whole number in the ensemble, and 
in a time exceeding T^(^fF) the number which pass out of 
the limits never to return would exceed the total number 
of systems in the ensemble. The proposition is therefore 
proved. 

This proof will apply to the cases before considered, and 
may be regarded as more simple than that which was given. 
It may also be applied to any true case of statistical equilib¬ 
rium. By a true case of statistical equilibrium is meant such 
as may be described by giving the general value of the prob¬ 
ability that an unspecified system of the ensemble is con¬ 
tained within any given limits of phase.* 

* An ensemble in which the systems are material points constrained to 
move in vertical circles, with just enough energy to carry them to the 
highest points, cannot afford a true example of statistical equilibrium. For 
any other value of the energy than the critical value mentioned, we might 
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Let us next consider whether an ensemble of isolated 
systems has any tendency in the cours: of time toward a 
state of statistical equilibrium. 

There are certain functions of phase which are constant in 
time. The distribution of the ensemble with respect to the 
values of these functions is necessarily invariable, that is, 
the number of systems within any limits which can be 
specified in terms of these functions cannot vary in the course 
of time. The distribution in ph^ise which without violating 
this condition gives the least value of the average index of 
probability of phase (^) is unique, and is that in wliich the 

in various ways describe an ensemble in statistical equilibrium, while the 
same language applied to the critical value of the energy would fail to do 
so. Thus, if we should say that the ensemble is so distributed that the 
probability that a system is in any given part of the circle is proportioned 
to the time which a single system spends in that part, motion in either direc¬ 
tion being equally probable, we should perfectly define a distribution in sta¬ 
tistical equilibrium for any value of the energy except the critical value 
mentioned above, but for this value of the energy all the probabilities in 
question would vanish unless the highest point is included in the part of the 
. circle considered, in which case the probability is unity, or forms one of its 
limits, in which case the probability is indeterminate. Compare the foot-note 
on page 118 . 

A still more simple example is afforded by the uniform motion of a 
material point in a straight line. Here the impossibility of statistical equi¬ 
librium is not limited to any particular energy, and the canonical distribu¬ 
tion as well as the microcanonical is impossible. 

These examples are mentioned here in order to show the necessity of 
caution in the application of the above principle, with respect to the question 
whether we have to do with a true case of statistical equilibrium. 

Another point in respect to which caution must be exercised is that the 
part of an ensemble of which the theorem of the return of systems is asserted 
should be entirely defined by limits within which it is contained, and not by 
any such condition as that a certain function of phase shall have a given 
value. This is necessary in order that the part of the ensemble whi(di is 
considered should be any assignable fraction of the whole. Thus, if we have 
a canonical ensemble consisting of material points in vertical circles, the 
theorem of the return of systems may be applied to a part of the ensemble 
defined as contained in a given part of the circle. But it may not be applied 
in all cases to a part of the ensemble defined as contained in a given part 
of the circle and having a given energy. It would, in fact, express the exact 
opposite of the truth when the given energy is the critical value mentioned 
above. 
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index of probability (?/) is a function of the functions men¬ 
tioned.* It is therefore a permanent distribution, f and the 
only permanent distribution consistent with the invariability 
of the distribution with respect to the functions of phase 
which are constant in time. 

It would seem, therefore, that we might find a sort of meas¬ 
ure of the deviation of an ensemble from statistical equilibrium 
in the excess of the average index above the minimum which is 
consistent with the condition of the invariability of the distri¬ 
bution with respect to the constant functions of phase. But 
we have seen that the index of probability is constiint in time 
for each system of the ensemble. The average index is there¬ 
fore constant, and we find by this method no approach toward 
statistical equilibrium in the course of time. 

Yet we must here exercise great caution. One function 
may approach indefinitely near to another function, wdiile 
some quantity determined by the first does not approach the 
corresponding quantity determined by the second. A line 
joining two points may approach indefinitely near to the 
straight line joining them, while its length remains constant. 
We may find a closer analogy with the case under considera¬ 
tion in the effect of stirring an incompressible liquid.J In 
space of 2 n dimensions the case might be made analyti¬ 
cally identical with that of an ensemble of systems of n 
degrees of freedom, but the analogy is perfect in ordinary 
space. Let us suppose the liquid to contain a certain amount 
of coloring matter which does not affect its hydrodynamic 
properties. Now the state in which the density of the coloring 
matter is uniform, i. e,, the state of perfect mixture, which is 
a sort of state of equilibrium in this respect that the distribu¬ 
tion of the coloring matter in space is not affected by the 
internal motions of the liquid, is characterized by a minimum 

♦ See Chapter XI, Theorem IV. 

t See Chapter IV, $ub iniU 

J By liquid is here meant the continuous body of theoretical hydrody¬ 
namics, and not anything of the molecular structure and molecular motions 
of real liquids. 
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value of the average square of the density of the coloring 
matter. Let us suppose, however, that the coloring matter is 
distributed with a variable density. If we give the liquid any 
motion whatever, subject only to the hydrodynamic law of 
incompressibility, — it may be a steady flux, or it may vary 
with the time, — the density of the coloring matter at any 
same point of the liquid will be unciianged, and the average 
square of this density will therefore be unchanged. Yet no 
fact is more familiar to us than tliat stirring tends to bring a 
liquid to a state of uniform mixture, or uniform densities of 
its components, which is characterized by minimum values 
of the average squares of these densities. It is quite true that 
in the physical experiment the result is hastened bj" the 
process of diffusion, but the result is evidently not dependent 
on that process. 

llie contradiction is to be traced to the notion of the density 
of the coloring matter, and the process by which this quantity 
is evaluated. This quantity is the limiting ratio of the 
quantity of the coloring matter in an element of space to the 
volume of that element. Now if we should teke for our ele¬ 
ments of volume, after any amount of stirring, the spaces 
occupied by the same portions of the liquid which originally 
occupies! any given system of elements of volume, the densi¬ 
ties of the coloring matter, thus estimated, would be identical 
with the original densities as determined by the given system 
of elements of volume. Moreover, if at the end of any finite 
amount of stirring we should take our elements of volume in 
any ordinary form but sufficiently small, the average square 
of the density of the coloring matter, as determined by such 
element of volume, would approximate to any required degree 
to its value before the stirring. But if we take any element 
of space of fixed position and dimensions, we may continue 
the stirring so long that the densities of the colored liquid 
estimated for these fixed elements will approach a uniform 
limit, viz., that of perfect mixture. 

The case is evidently one of those in which the limit of a 
limit has different values, according to the order in which we 
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apply the processes of taking a limit. If treating the elements 
of volume as constant, we continue the stirring indefinitely, 
we get a uniform density, a result not affected by making the 
elements as small as we choose; but if treating the amount of 
stirring as finite, we diminish indefinitely the elements of 
volume, we get exactly the same distribution in density as 
before the stirring, a result which is not affected by con¬ 
tinuing the stirring as long as we clioose. The question is 
largely one of language and definition. One may perhaps be 
allo'wed to say that a finite amount of stirring will not affect 
the mean square of the density of the coloring matter, but an 
infinite amount of stirring may be reganled as producing a 
condition in which the mean square of the density has its 
minimmn value, and the density is uniform. We may cer¬ 
tainly say that a sensibly uniform density of the colored com¬ 
ponent may be produced by stirring. Whether the time 
required for this result would be long or short depends upon 
the nature of the motion given to the hquid, and the fineness 
of our method of evaluating the density. 

All this may appear more distinctly if we consider a special 
case of liquid motion. Let us imagine e cylindrical mass of 
liquid of which one sector of 90® is black and the rest white. 
Let it have a motion of rotation about the axis of the cylinder 
in which the angular velocity is a function of the disUince 
from the axis. In the course of time the black and the white 
parts would become drawn out into thin ribbons, which would 
be wound spirally about the axis. The thickness of these rib¬ 
bons would diminish without limit, and the liquid would there¬ 
fore tend toward a state of perfect mixture of the black and 
white portions. That is, in any given element of space, the 
proportion of the black and white would approach 1: 3 as a limit. 
Yet after any finite time, the total volume would be divided 
into two parts, one of which would consist of the white liquid 
exclusively, and the other of the black exclusively. If the 
coloring matter, inste^id of being distributed initially with a 
uniform density throughout a section of the cylinder, were 
distributed with a density represented by any arbitrary func- 
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tion of the cylindrical coordinates r, 0 and z, the effect of the 
same motion continued indefinitely would be an approach to 
a condition in which the density is a func'.tion of r and z alone. 
In this limiting condition, the average square of the density 
would be less than in the original condition, when the density 
was supposed to vary with o', although after any finite time 
the average square of the density would be the same as at 
first. 

If we limit our attention to the motion in a single plane 
perpendicular to the axis of the cylinder, we have something 
which is almost identical with a diagrammatic representation 
of the changes in distribution in phase of an ensemble of 
systems of one degree of freedom, in which the motion is 
j^riodic, the period varying with the energy, as in the case of 
a pendulum swinging in a circular arc. If the coordinates 
and momenta of the systems are represented by rectangu¬ 
lar coordinates in the diagram, the points in the diagram 
representing the changing phases of moving systems, will 
move about the origin in closed curves of constant energy. 
The motion will be such that areas bounded by })oints repre¬ 
senting moving systems will l)e preserved. The only differ¬ 
ence between the motion of the liquid and the motion in the 
diagram is tliat in one case the paths are circular, and in the 
other they differ more or less from that form. 

When the energy is proportional to -h cf the curves of 
constant energy are circles, and the period is independent of 
the energy. There is then no tendency towaixl a state of sta¬ 
tistical equilibrium. The diagram turns about the origin with¬ 
out change of form. This corresponds to the case of hquid 
motion, when the liquid revolves with a uniform angular 
velocity like a rigid solid. 

The analogy between the motion of an ensemble of systems 
in an extension-in-phase and a steady current in an incompres¬ 
sible liquid, and the diagrammatic representation of the case 
of one degree of freedom, which appeals to our geometrical in¬ 
tuitions, may be sufficient to show how the conservation of 
density in phase, which involves the conservation of the 



148 


MOTION OF SYSTEMS AND ENSEMBLES 


average value of the index of probability of phase, is consists 
ent with an approach to a limiting condition in which that 
average value is less. We might perhaps fairly infer from 
such considerations as have been adduced that an approach 
to a limiting condition of statistical equilibrium is the general 
rule, wlien the initial condition is not of that cliaracter. But 
the subject is of such importance that it seems desirable to 
give it farther consideration. 

Let us suppose that the total extension-in-phase for the 
kind of system considered to be divided into equal elements 
(i> F) which are very small but not infinitely small. Let us 
imagine an ensemble of systems distributed in this extension 
in a manner represented by the index of probability which 
is an arbitrary function of the phase subject only to the re¬ 
striction expressed by equation (46) of Chapter L We shall 
suppose the elements DV io be so small that t) may in gen¬ 
eral be regarded as sensibly constant within any one of tliem 
at the initial moment. Let the path of a system be defined as 
the series of phases througli which it passes. 

At the initial moment a certain system is in an element 
of extension DVK Subsequently, at the time the same 
system is in the element DV''. Other systems which were 
at fii’st in D V' will at the time t" be in D V"^ but not all, 
probably. The systems which were at first in DV' will at 
the time t" occupy an extension-in-phase exactly as large as at 
tii'st. But it will probably be distributed among a very great 
number of the elements (jDF) into which we have divided 
the total extension-in-phase. If it is not so, we can generally 
take a later time at which it will be so. There will be excep¬ 
tions to this for particular laws of motion, but we will con¬ 
fine ourselves to what may fairly be called the general case. 
Only a very small part of the systems initially in DF' will 
be found in DV" at the time f", and those which are found in 
DV" at that time were at the initial moment distributed 
among a very large number of elements D F. 

What is important for our purpose is the value of 7 ;, the 
index of probability of phase in the element 2 >F'' at the time 
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VK In the part of DV" occupied by systems which at the 
time tf were in DF' the value of 77 will be the same as its 
value in D V' at the time t', which we shall call 77 '. In the 
parts of occupied by systems which at I' were in ele¬ 

ments very near to H F' we may suppose the value of 77 to 
vary little from 77 ^ We cannot assume this in regard to parts 
of i>F" occupied by systems which at tf were in elements 
remote from i>F'. We want, tlierefore, some idea of the 
nature of the exteiision-in-phase occupied at t' by the sys¬ 
tems which at will occupy D F". Analytically, the prob¬ 
lem is identical with finding the extension occupied at 
by the systems which at occupied 2>F'. Now the systems 
in D F" which lie on the same path as the system first con¬ 
sidered, evidently arrived at at nearly the same time, 

and must liave left D F' at nearly the same time, and there¬ 
fore at i' were in or near DVK We may therefore take 77' as 
the value for these systems. The same essentially is true of 
systems in DF" which lie on paths very close to the path 
already considered. But with respect to paths passing through 
D F' and D F'', but not so close to the first path, we cannot 
assume that the time required to pass from F' to D F'' is 
nearly the siirae as for tlie first path. The difference of the 
times required may be small in cojiiparison with but as 

this interval can be as large as we choose, the difference of the 
times required in the different paths has no limit lo its pos¬ 
sible value. Now if the case were one of statistical equilib¬ 
rium, the value of 77 would be conshint in any path, and if all 
the paths which pass througli BV^* also pass through or near 
D V'y the value of 77 throughout B F" will vary little from 
77'. But when the case is not one of statistical equilibrium, 
we cannot draw any such conclusion. The only conclusion 
which we can draw with respect to the phase at V of the sys¬ 
tems whicli at are in i>F" is that they are nearly on the 
same path. 

Now if we should make a new estimate of indices of prob¬ 
ability of phase at the time V\ using for this purpose the 
elements i> F — that is, if we should divide the number of 
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systems in i> F", for example, by the total number of systems, 
and also by the extension-in-phase of the element, and take 
the logarithm of the quotient, we would get a number which 
would be less than the average value of rj for the systems 
within D V" based on the distribution in pliase at the time V* 
Hence the average value of for the whole ensemble of 
systems based on the distribution at will be less than the 
average value based on the distribution at f'. 

We must not forget that there are exceptions to this gen¬ 
eral rule. These exceptions are in cases in which the laws 
of motion are such that s 3 "stems having small differences 
of phase will continue always to have small differences of 
phase. 

It is to be observed that if the average index of probability in 
an ensemble may be said in some sense to have a less value at 
one time than at another, it is not necessarily priority in time 
which determines the greater average index. If a distribution, 
which is not one of statistical equilibrium, should be given 
for a time and the distribution at an earlier time should 
be defined as that given by the corresponding phases, if we 
increase the interval leaving fixed and taking at an earlier 
and earlier date, the distribution at will in geneml approach 
a limiting distribution wliich is in statistical equilibrium. The 
determining difference in such cases is that between a definite 
distribution at a definite time and the limit of a var}dng dis¬ 
tribution when the moment considered is carried either forward 
or backward indefinitely.| 

But while the distinction of prior and subsequent events 
maj^ be immaterial with respect to mathematical fictions, it is 
quite otlierwise with respect to the events of the real world. 
It should not be forgotten, when our ensembles are chosen to 
illustrate the probabilities of events in the real world, that 

* See Chapter XI, Theorem IX. 

t One may compare the kinematical truism that when two points are 
moving with uniform velocities, (with the single exception of the case where 
the relative motion is zero,) their mutual distance at any definite time is less 
than for t = Qo, or t = — Qo. 
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while the probabilities of subsequent events may often be 
determined from the probabilities of prior events, it is rarely 
the case tliat probabilities of prior events can be determined 
from those of subsequent events, for we are rarely justified in 
excluding the consideration of the antecedent probability of 
the prior events. 

It is worthy of notice that to take a system at random from 
an ensemble at a date chosen at random from several given 
dates, V, etc., is practically the same thing as to take a sys¬ 
tem at random from the ensemble composed of all the systems 
of the given ensemble in their phases at the time together 
with the same systems in their phases at the time t", etc. By 
Theorem VIII of Chapter XI this will give an ensemble in 
which the average index of probability will be less than in 
the given ensemble, except in the case when the distribution 
in the given ensemble is the same at the times etc. 
Consequently, any indefiniteness in the time in which we take 
a system at random from an ensemble has the practical effect 
of diminishing the average index of the ensemble from which 
the system may be supposed to be drawn, except when the 
given ensemble is in statistical equilibrium. 
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EFFECT OF VARIOUS PROCESSES ON AN ENSEMBLE OF 

SYSTEMS. 

In the last chapter and in Chapter I we have considered the 
changes which take place in the course of time in an ensemble 
of isolated systems. Let us now proceed to consider the 
changes which will take place in an ensemble of systems under 
external influences. These external influences will be of two 
kinds, the variation of the coordinates which we have called 
external^ and the action of other ensembles of systems. The 
essential difference of the two kinds of influence consists in 
this, that the bodies to which the external coordinates relate 
are not distributed in phase, while in the case of interaction 
of the systems of two ensembles, we liave to regard the fact 
that both are distributed in phase. To find the effect pro¬ 
duced on the ensemble with wfliich we are principally con¬ 
cerned, we have therefore to consider single values of what 
we have called external coordinates, but an infinity of values 
of the internal coordinates of any other ensemble with which 
there is interaction. 

Or, — to regard the subject from another point of view, — 
the action between an unspecified system of an ensemble and 
the bodies represented by the external coordinates, is the 
action betw^een a system imperfectly determined with respect 
to phase and one which is perfectly determined; while the 
interaction between two unspecified systems belonging to 
different ensembles is the action between two systems both of 
which are imperfectly determined with respect to phase.* 

We shall suppose the ensembles which we consider to be 
distributed in pliase in the manner described in Chapter I, and 

♦ In the development of the subject, we shall find that this distinction 
corresponds to the distinction in thermodynamics between mechanical and 
thermal action. « 
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represented by the notations of that chapter, especially by the 
index of probability of phase (i;). There are therefore 2 n 
independent variations in the phases which constitute the 
ensembles considered. This excludes ensembles like the 
microcanonical, in which, as energy is constant, there are 
only 2 w •— 1 independent variations of phase. This seems 
necessary for the purposes of a general discussion. For 
although we may imagine a microcanonical ensemble to have 
a permanent existence when isolated from external influences, 
the effect of such influences would generally be to destroy the 
uniformity of energy in the ensemble. Moreover, since the 
microcanonical ensemble may be regarded as a limiting case of 
such ensembles as are described in Chapter I, (and that in 
more than one way, as shown in Chapter X,) the exclusion is 
rather formal than real, since any properties which belong to 
the microcanonical ensemble could easily be derived from those 
of the ensembles of Chapter I, which in a ceiiain sense may 
be regarded as representing the general case. 

Let us first consider the effect of variation of the external 
coordinates. We have already had occasion to regard these 
quantities as variable in the differentiation of certain equations 
relating to ensembles distributed according to certain laws 
called canonical or microcanonical. That variation of the 
external coordinates was, however, only carrying the atten¬ 
tion of the mind from an ensemble with certain values of the 
external coordinates, and distributed in phase according to 
some general law depending upon those values, to another 
ensemble with different values of the external cobixlinates, and 
with the distribution changed to conform to these new values. 

What we have now to consider is the effect which would 
actually result in the course of time in an ensemble of systems 
in which the external coordinates should be varied in any 
arbitrary manner. Let us suppose, in the first place, that 
these co^inlinates are varied abruptly at a given instant, being 
constant both before and after that instant. By the definition 
of the external coordinates it appears that this variation does 
not affect the phase of any system of the ensemble at the time 
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when it takes place. Therefore it does not ajffect the index of 
probability of phase (?;) of any system, or the average value 
of the index (rj) at tliat time. And if these quantities are 
constant in time before the variation of the external cob'rdi- 
nates, and after that variation, their constancy in time is not 
interrupted by that variation. In fact, in the demonstration 
of the conservation of probability of phase in Chapter I, the 
variation of the external cooixlinates was not excluded. 

But a variation of the external coordinates will in general 
disturb a previously existing state of statistical equilibrium. 
For, although it does not affect (at the first instant) the 
distribution-in-phase, it does affect the condition necessary for 
equilibrium. Tliis condition, as we have seen in Chapter IV, 
is that the index of probability of phase shall be a function of 
phase which is constant in time for moving systems. Now 
a change in the external coordinates, by changing the forces 
w’hich act on the systems, will change the nature of the 
functions of phase which are constant in time. Therefore, 
the distribution in phase which was one of statistical equi¬ 
librium for the old values of the external coordinates, will not 
be such for the new values. 

Now we have seen, in the last chapter, that when the dis¬ 
tribution-in-phase is not one of statistical equilibrium, an 
ensemble of systems may, and in general will, after a longer or 
shorter time, come to a state which may be regarded, if very 
small differences of phase are neglected, as one of statistical 
equilibrium, and in which consequently the average value of 
the index (^) is less than at first. It is evident, therefore, 
that a variation of the external coordinates, by disturbing a 
state of statistical equilibrium, may indirectly cause a diminu¬ 
tion, (in a certain sense at least,) of the value of 

But if the change in the external coordinates is very small, 
the change in the distribution necessary for equilibrium will 
in general be correspondingly small. Hence, the original dis¬ 
tribution in phase, since it differs little from one which would 
be in statistical equilibrium with the new values of the ex¬ 
ternal coordinates, may be supposed to have a value of v 
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which differs by a small quantity of the second order from 
the minimum value which characterizes the state of statistical 
equilibrium. And the diminution in the average index result¬ 
ing in the course of time from the very small change in the 
external coordinates, cannot exceed this small quantity of 
the second order. 

Hence also, if the change in the external coordinates of an 
ensemble initially in statistical equilibrium consists in suc¬ 
cessive very small changes separated by very loiig intervals of 
time in wdiich the disturbance of statistical equilibrium l>e- 
comes sensibly effaced, the final diminution in the average 
index of probability will in general be negligibhs although the 
total change in the external coordinates is large. The result 
will be the same if the change in the external coordinates 
takes place continuously but sufficiently slowdy. 

Even in cases in wdiich there is no tendency toward the 
restoration of statistical equilibrium in the lapse of time, a varia¬ 
tion of external coordinates wdiich would cause, if it took 
place in a short time, a great disturbance of a previous state 
of equilibrium, may, if sufficiently distributed in time, produce 
no sensible disturbance of the stiitistical equilibrium. 

Thus, in the case of three degrees of freedom, let the systems 
be heavy points suspended by elastic massless cords, and let the 
ensemble be distributed in phase with a density proportioned 
to some function of the energy, and therefore in statistical equi¬ 
librium. For a change in the external coordinates, we may 
hike a horizontal motion of the point of suspension. If this 
is moved a given distance, the resulting disturbance of the 
statistical equilibrium may evidently be diminished indefi¬ 
nitely by diminishing the velocity of the point of suspension. 
This will be true if the law of elasticity of the string is such 
that the period of vibnition is independent of the energy, in 
whic^h case there is no tendency in the course of time toward 
a state of statistical equilibrium, as well as in the more general 
case, in v/hich there is a tendency toward statistical equilibrium. 

That something of this kind will be true in general, the 
following considerations will tend to show. 
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We define a path as the series of phases through which a 
system passes in the course of time when tlie external co¬ 
ordinates have fixed values. When the external coordinates 
are varied, paths are changed. The path of a phase is the 
path to which that phase belongs. With reference to any 
ensemble of systems we shall denote by 2?]^ the average value 
of the density-in-phase in a path. This implies that we have 
a measure for comparing different portions of the path. We 
shall suppose the time required to traverse any portion of a 
path to be its measure for the purpose of determining this 
average. 

With this understanding, let us suppose that a certain en¬ 
semble is in statistical equilibrium. In every element of 
extension-in-phase, therefore, the density-in-phase I) is equal 
to its path-average D]p. Let a sudden small change be made 
in the external coordinates. The statistical equilibrium will be 
disturbed and we shall no longer have D = 17\p everywhere. 
This is not because D is changed, but because 2)]^ is changed, 
the paths being changed. It is evident that if > 2?]^ in 
a part of a path, we shall have 2> < 7?!^ in other parts of the 
same path. 

Now, if we should imagine a further change in the external 
cooixiinates of the same kind, we should expect it to produce 
an effect of the same kind. But the manner in which the 
second effect will be superposed on the first will be different, 
according as it occurs immediately after the first change or 
after an interval of time. If it occurs immeduitely after the 
first change, then in any element of phase in which the first 
change produced a positive value of 2> - 2?]^ the second change 
will add a positive value to the first positive value, and where 
D - 27|p was negative, the second change will add a negative 
value to the first negative value. 

But if we wait a sufficient time before making the second 
change in the external coordinates, so that systems have 
passed from elements of phase in which D - 2?|p was origi¬ 
nally positive to elements in which it was originally negative^ 
and vice versa, (the systems carrying with them the values 
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of -D - iy\p ,) the positive values of 2> - 2>]p caused by the 
second change will be in part superposed on negative values 
due to the first change, and vice versa. 

The disturbance of statistical equilibrium, therefore, pro¬ 
duced by a given change in the values of the external co¬ 
ordinates may be very much diminished by dividing the 
change into two parts separated by a sufficient interval of 
time, and a sufficient interval of time for this purpose is one 
in which the phases of the individual systems are entirely 
unlike the first, so that any individual system is differently 
affected by the change, although tlie whole ensemble is af¬ 
fected in nearly the same way. Since there is no limit to the 
diminution of the disturbance of equilibrium by division of 
the change in the external coordinates, we may suppose as 
a general rule that by diminishing the velocity of the changes 
in the external coordinates, a given change may be made to 
produce a very small disturbance of statistical equilibrium. 

If we write for the value of the average index of probability 
before tlie variation of the external coordinates, and for the 
value after this variation, we shall have in any case 

V 

as the simple result of the variation of the external coordi¬ 
nates. This may be compared with the thermodynamic the¬ 
orem that the entropy of a body cannot be diminished by 
mechanical (as distinguished from thermal) action.^ 

If we have (approximate) statistical equilibrium between 
the times t' and tf' (corresponding to rj' and >)"), we shall have 
approximately 

*“/ _ ■"// 

V = V y 

which may be compared with the thermodynamic theorem that 
the entropy of a body is not (sensibly) affected b}^ mechanical 
action, during which the body is at each instant (sensibly) in 
a state of thermodynamic equilibrium. 

Approximate statistical equilibrium may usually be attained 

* The correspondences to which the reader's attention is called are between 
— 71 and entropy, and between 0 and temperature. 
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by a sufficiently slow variation of the external coordinates, 
just as approximate thermodynamic equilibrium may usually 
be attained by sufficient slowness in the mechanical operations 
to which the body is subject. 

We now pass to the consideration of tlie effect on an en¬ 
semble of systems which is produced by the action of other 
ensembles with which it is brought into dynamical connec¬ 
tion. In a previous chapter ♦ we have imagined a dynamical 
connection arbitrarily created between the systems of two 
ensembles. We shall now regard the action between the 
systems of the two ensembles as a result of the variation 
of the external coordinates, which causes such variations 
of the internal coordinates as to bring the systems of the 
two ensembles within the range of each otlner’s action. 

Initially, we suppose that we have two separate ensembles 
of systems, JS^ and The numbers of degrees of freedom 
of the systems in the two ensembles will be denoted by and 
respectively, and the probability-coefficients by and 
Now we may regard any system of the first ensemble com¬ 
bined with any system of the second as forming a single 
system of -f ^2 degrees of freedom. Let us consider the 

ensemble (^^2) obtained by thus combining each system of the 

first ensemble with each of the second. 

At the initial moment, which may be specified by a single 
accent, the probability-coefficient of any phase of the combined 
systems is evidently the product of the probability-coefficients 
of the phases of which it is made up. This may be expressed 
by the equation, 

( 455 ) 

or 7713' = 771' 4- 772', ( 456 ) 

which gives ^2' = + ^2'* 

The forces tending to vary the internal cocirdinates of the 
combined systems, together with those exeHed by either 
system upon the bodies represented by the coordinates called 


* See Chapter IV, page 37. 
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external, may be derived from a single force-function, which, 
taken negatively, we shall call the potential energy of the 
combined systems and denote by we suppose that 

initially none of the systems of the two ensembles Ei and 
E, come within range of each other’s action, so that the 
potential energy of the combined system falls into two parts 
relating separately to the systems which are combined. The 
same is obviously true of the kinetic energy of the combined 
compound system, and therefore of its total energy. This 
may be expressed by the equation 

€ 1 /= €/ + €/, (458) 

which gives ^12 = e/ + (459) 

Let us now suppose that in the course of time, owing to the 
motion of the bodies represented by the coordinates called 
external, the forces acting on the systems and consequently 
their positions are so altered, that the systems of the ensembles 
JPj and are brought within range of each other’s action, 
and after such mutual influence has lasted for a time, by a 
further change in the external coordinates, perhaps a return 
to their original values, the systems of the two original en¬ 
sembles are brought again out of range of each other’s action. 
Finally, then, at a time specified by double accents, we shall 
have as at first 

= e/' + (460) 

But for the indices of probability we must write * 

+ (461) 

The considerations adduced in the last chapter show that it 
is safe to write 

^ 12 " < ^ 12 '. (462) 

We have therefore 

^ 1 " + ^ 2 " ^ Vi + -qjy (463) 

which may be compared with the thermodynamic theorem that 


* See Chapter XI, Theorem VIL 
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Che thermal contact of two bodies may increase but cannot 
diminish the sum of their entropies. 

Let us especially consider the case in which the two original 
ensembles were botli canonically distributed in phase with the 
respective moduli and ©j. We have then, by Theorem III 
of Chapter XI, 



(464) 

7' ff 

(466) 

Whence with (463) we have 


!l 4- !l < l'_ + ^ 

©1 ©2 = ©1 ©a 

(466) 

'in ^ 

or ^ + /.A = o. 

©a 

(467) 

If we write W for the average work done by the combined 
systems on the external bodies, we have by the principle of 
the conservation of energy 

W= eia' - da" = e,' - d" + d' - d". 

(468) 

Now if IT is negligible, we have 


d" - d' = - (e"a" - e7) 

(469) 


and (467) shows that the ensemble which has the. greater 
modulus must lose energy. This result may be compared to 
the thermodynamic principle, tliat when two bodies of differ¬ 
ent temperatures are brought together, that which has the 
higher temperature will lose energy. 

Let us next suppose that the ensemble J ?2 originally 
canonically distributed with the modulus 0^ > but leave the 
distribution of the other arbitrary. We have, to determine 
the result of a similar process. 


Vi 


+ ^ ^ Vi” + 
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Hence 


(470) 

which may 

be written 


’Ji' - rn" ^ 

(471) 


This may be compared with the thermodynamic principle that 
when a body (which need not Ije in thennal equilibrium) is 
brought into thermal contact with another of a given tempera- 
ture, the increase of entropy of the first cannot be less (alge¬ 
braically) than the loss of heat by the second divided by its 
temperature. Where W is negligible, we may write 

+ + | (472) 

Now, by Theorem III of Chapter XI, the quantity 

+ I ( 473 ) 

has a minimum value when the ensemble to which and ej 
relate is distributed canonically with the modulus If the 
ensemble had originally tiiis distribution, the sign < in (472) 
would be impossible. In hict, in this case, it would be easy to 
show that the preceding formulae on which (472) is founded 
would all liave the sign — . But when the two ensembles are 
not both originally distributed canonically with the same 
modulus, the formulae indicate that the quantity (473) may 
be diminished by bringing the ensemble to which and 
relate into connection with another which is canonically dis¬ 
tributed with modulus Og’ therefore, that by repeated 
operations of this kind tlie ensemble of which the original dis¬ 
tribution was entirely arbitnir^^ might be brought approxi¬ 
mately into a state of canonical distribution with the modulus 
02 * We may compare this with the thermodynamic principle 
that a body of which the original thermal state may be entirely 
arbitrary, may be brought approximately into a state of ther¬ 
mal equilibrium with any given temperature by repeated con¬ 
nections with other bodies of that temperature. 
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Let us now suppose that we have a certain number of 
ensembles, JE ^, Ij\ , , etc., distributed canonically with the 

respective moduli , 02 > variation of the exter¬ 

nal coordinates of the ensemble jE'j, let it be brought into 
connection witli E ^, and then let the connection be brx)ken. 
Let it then be brought into connection witli E ,^, and then let 
that connection l>e broken. Let this process be continued 
with respect to the remaining ensembles. We do not make 
the assumption, as in some Ciises Ixidore, that the work connected 
with the variation of the external cooixlinates is a negligible 
quantity. On the contrary, we wish esjiecially to consider 
the case in whic^h it is large. In the final state of the ensem¬ 
ble Eq^ let us suppose that the external coordinates have been 
brought back to their original values, and tliat the average 
energy (e^) is tlie same as at first. 

In our usual notations, using one and two accents to dis¬ 
tinguish original and finiil values, we get by rejxiated applica¬ 
tions of the principle expressed in (468) 

V 4- vi + V ‘2 -f etc. > 770 " + -f etc. (474) 

But by Theorem III of Chapter XI, 

no" +1 ^ + I'. (475) 


eJ' ~ eA 

+ ?)7 ^ ^ ’ 


Vi 






etc. 

ff 


1 ;+ 1 :+ 1 ; + - ©0 ++ 1 + 


or, since 


zi — ~fV 

to — to , 

0 + etc. 


©1 


©, 


(476) 

(477) 

(478) 

(479) 


If we write W for the average work done on the bodies repre¬ 
sented by the external cohrdinates, we have 



ON AN ENSEMBLE OF SYSTEMS. 


163 


€,' - + €j - Cj" + etc. = W. (480) 

If E^, and E.^ are the only ensemble.?, we have 

(481) 

It will be observed that the relations expressed in the last 
three formulae between — e/', etc., and 

© 2 ^ are precisely those which hold in a Carnot’s cycle for 
the work obtained, the energy lost by the seveial bodies which 
serve as heaters or coolers, and their initial temperatures. 

It will not escape the reader’s notice, that while from one 
point of view the operations which are here described are quite 
beyond our powers of actual performance, on account of the 
impossibility of handling the immense number of systems 
which are involved, yet from another point of view the opera¬ 
tions described are the most simple and accurate means of 
representing what actually takes place in our simplest experi¬ 
ments in thermodynamics. The states of the bodies which 
we handle are certainly not known to us exactly. What we 
know about a body can generally be described most accurately 
and most simply by saying tliat it is one taken at random 
from a great number (ensemble) of bodies which are com¬ 
pletely described. If we bring it into connection with another 
body concerning which we have a similar limited knowledge, 
the shite of the two bodies is properly descrited as that of a 
pair of bodies taken from a great nurnter (ensemble) of pairs 
which are formed by combining each body of the first en¬ 
semble with each of the second. 

Again, when we bring one body into thermal contact with 
another, for example, in a Carnot’s cycle, when we bring a 
mass of fluid into thermal contact vdth some other body from 
which we wish it to receive heat, we may do it by moving the 
vessel containing the fluid. This motion is mathematic idly 
expressed by tlie variation of the coordinates which determine 
the position of the vessel. We allow ourselves for the pur¬ 
poses of a theoretical discussion to suppose that the walls of 
this vessel are incapable of absorbing heat from the fluid. 
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Yet while we exclude the kind of action which we call ther¬ 
mal between the fluid and tlie containing vessel, we allow the 
kind which we call work in the narrower sense, which takes 
place when the volume of the fluid is changed by the motion 
of a piston. This agrees with what we liave supposed in 
regard to the external coordinates, which we may vary in 
any arbitrary manner, and are in this entirely unlike the co¬ 
ordinates of the second ensemble with which we bring the 
first into connection. 

When heat passes in any theimodynamic experiment between 
the fluid principally considered and some other body, it is 
actually absorlwd and given out by the walls of the vessel, 
which will retain a varying quantity. This is, however, a 
disturbing circumstance, which we suppose in some way made 
negbgible, and actually neglect in a theoretical discussion. 
In our case, we suppose the walls incapable of absorbing en¬ 
ergy, except through the motion of the external cotiniinates, 
but that they allow the systems which they contain to act 
directly on one another. Properties of this kind are mathe¬ 
matically expressed by supposing that in the vicinity of a 
certain surface, the position of which is determined by certain 
(external) coordinates, particles belonging to the system in 
question exj)e.rience a repulsion from the surface increasing so 
rapidly wdth nearness to the surface that an infinite expendi¬ 
ture of energy would be required to carry them through it. 
It is evident that two systems might be separated by a surface 
or surfaces exerting the proper forces, and yet approach each 
other closely enough to exert mechanical action on each other. 



CHAPTER XIV. 


DISCUSSION OF THERMODYNAMIC ANALOGIES. 

If we wish to find in rational mechanics an a priori founda¬ 
tion for the principles of thermodynamics^ we must seek 
mechanical definitions of temperature and entropy. The 
quantities thus defined must satisfy (under conditions and 
with hmitations which again must l)e specified in the language 
of mechanics) the differential equation 

de = Tdr) — Ax dox — A^ da^ — etc., (^^2) 

where e, T, and t] denote the energy, temperature, and entropy 
of the system considered, and etc., the mechanical work 

(in the narrower sense in which the term is used in thermo¬ 
dynamics, i. c,, with exclusion of thennal action) done upon 
external bodies. 

This implies that we are able to distinguish in mechanical 
terms the thermal action of one system on another from that 
whicli we call mechanical in the narrower sense, if not indeed 
in every case in which the two ma) be combined, at least so as 
to specify cases of thermal action and cases of mechanical 
action. 

Such a differential equation moreover implies a finite equa¬ 
tion between e, 77 , and a^, ag, etc., which may be regarded 
as fundamental in regard to those properties of the system 
wliich we call thermodynamic, or which may be called so from 
analogy. Tliis fundamental thermodynamic equation is de¬ 
termined by the fundamental mechanical equation which 
expresses the energy of the system as function of its mo¬ 
menta and coordinates with those external coordinates 
etc.) which appear in the differential expression of the work 
done on external bodies. We have to show the mathematical 
operations by which tlie fundamental thermod 3 rnamic equation, 
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which in general is an equation of few variables, is derived 
from the fundamental mechanical equation, which in the case 
of tlie bodies of nature is one of an enormous number of 
variables. 

We have also to enunciate in mechanical terms, and to 
prove, what we call the tendency of heat to pass from a sys¬ 
tem of higher temperature to one of lower, and to show that 
this tendency vanishes with respect to systems of the same 
temperature. 

At least, we have to show by a priori reasoning that for 
such systems as the material bodies wliich nature presents to 
us, these relations hold with such approximation that they 
are sensibly true for human faculties of obseiwation. This 
indeed is all that is really necessary to establish the science of 
thermodynamics on an a priori basis. Yet we will naturally 
desire to find the exact expression of those principles of wliich 
the laws of therm(xlynamics are the approximate expression. 
A very little study of the statistical properties of conservative 
systems of a finite number of degrees of freedom is sufficient 
to make it appear, more or less distinctly, that the general 
laws of thermodynamics are the limit toward which the exact 
laws of such systems approximate, when their number of 
degrees of freedom is indefinitely increased. And the problem 
of finding the exact relations, as distinguished from the ap¬ 
proximate, for systems of a great number of degrees of free¬ 
dom, is practically the same as that of finding the relations 
which hold for any number of degrees of freedom, as distin¬ 
guished from those which have been established on an em¬ 
pirical basis for systems of a great number of degrees of 
freedom. 

The enunciation and proof of these exact laws, for systems 
of any finite number of degrees of freedom, has been a princi¬ 
pal object of the preceding discussion. But it should be dis¬ 
tinctly stated that, if the results obtained when the numbers 
of degrees of freedom are enormous coincide sensibly with 
the general kv^s of thermodynamics, however interesting and 
significant this coincidence may be, we are still far from 
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having explained the phenomena of nature with respect to 
these laws. For, as compared with tlie case of nature, the 
systems wliich we liave considered are of an ideal simplicity. 
Although our only assumption is tliat we are considering 
conservative systems of a finite number of degrees of freedom, 
it would seem that this is assuming far too much, so far as the 
bodies of nature are concerned. The phenomena of radiant 
heat, which certainly should not be neglected in any complete 
system of thermodynamics, and the electrical phenomena 
associated with the combination of atoms, seem to show tliat 
the hypothesis of systems of a finite number of degrees of 
freedom is inadequate for the explanation of the properties of 
bodies. 

Nor do the results of such assumptions in every detail 
appear to agree with experience. We should expect, for 
example, that a diatomic gas, so far as it could be treated 
independently of the phenomena of radiation, or of any sort of 
electrical manifestations, would have six degrees of freedom 
for each molecule. But the behavior of such a gas seems to 
indicate not more than five. 

But although these difficulties, long recognized by physi¬ 
cists,* seem to prevent, in the present state of science, any 
satisfactory explanation of the phenomena of thermodynamics 
as presented to us in nature, tlie ideal case of systems of a 
finite number of degrees of freedom remains as a subject 
which is certainly not devoid of a theoretical interest, and 
which may serve to point the way to the solution of the far 
more difficult problems presented to us by nature. And if 
the study of the statistical properties of such systems gives 
us an exact expression of laws which in the limiting case take 
the form of the received laws of thermodynamics, its interest 
is so much the greater. 

Now we have defined what we have called the modulus (0) 
of an ensemble of systems canonically distributed in phase, 
and what we have called the index of probability (r{) of any 
phase in such an ensemble. It has been shown that between 

♦ See Boltzmann, Sitzb. der Wiener Akad., Bd. LXIII., S. 418, (1871). 
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the mcxlulus (0), the external coordinates (a ^, etc.), and the 
average values in the ensemble of the energy (e), the index 
of probability ( 77 ), and the external forces (J.^, etc.) exerted 
by the systems, the following differential equation will hold: 

iZe = — (ddTf — Ai dcii — ^2 ^^^2 — ^tc. (^^ 3 ) 

This equation, if we neglect the sign of averages, is identical 
in form with the thermodynamic equation (482), the modulus 
(0) corresponding to temperature, and the index of probabil¬ 
ity of phase with its sign reversed corresponding to entropy.* 

We have also shown that tlie average square of the anoma¬ 
lies of 6 , that is, of the deviations of the individual values from 
the average, is in general of the same order of magnitude as 
the reciprocal of the number of degrees of freedom, and there¬ 
fore to human observ^ation the individual values are indistin¬ 
guishable from the average values when the number of degrees 
of freedom is very gi*eat.t In this case also the anomalies of 7 ; 
are practically insensible. The same is true of the anomalies of 
the external forces (^Ai , etc.), so far as these are the result of 
the anomalies of energy, so that wlien these forces are sensibly 
determined by the energy and the external coordinates, and 
the number of degrees of freedom is very great, the anomalies 
of these forces are insensible. 

The mathematical operations by wliich the finite equation 
between e, 77 , and , etc., is deduced from tlmt whic4i gives 
the energy (e) of a system in tenns of the momenta ... .p^ 
and cob’rdinates both internal and external , etc.), 

are indicated by the equation 


wdiere 


^ all € 

^ -J' - ■J'^ • • • ^Pn, 


(484) 


phases 


^ = @7^ -f- e. 


We have also shown that when systems of different ensem¬ 
bles are brought into conditions analogous to thermal contact, 
the average result is a passage of energy from the ensemble 


* See Chapter IV, pages 44, 46. 


t See Chapter VII, pages 73-76. 
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of the greater modulus to that of the less, * or in case of equal 
moduli, that we have a condition of statistical equilibrium in 
reganl to the distribution of energy.f 

Propositions have also been demonstrated analogous to 
those in thennodyiiamics relating to a Carnot’s cyele4 or to 
the tendency of entropy to increase, § especially when bodies 
of different temperature are brought into contact. || 

We have thus precisely defined quantities, and rigorously 
demonstrated propositions, which hold for any number of 
degrees of freedom, and which, when the number of degrees 
of freedom (n) is enormously great, would appear to human 
faculties as the quantities and propositions of empirical ther¬ 
modynamics. 

It is evident, however, that there may be more than one 
quantity defined for finite values of n, which approach the 
same limit, when n is increased indefinitely, and more than one 
proposition relating to finite values of n, which approach the 
same limiting form for n *.= oo. There may be therefore, 
and there are, other quantities wliicli may be thought to have 
some claim to be regarded as temperature and entropy with 
respect to systems of a finite number of degrees of freedom. 

The definitions and propositions wiiich we have been con¬ 
sidering relate essentially to what we have called a canonical 
ensemble of systems. This may appear a less natural aTid 
simple conception than what we have called a microcanonical 
ensemble of systems, in wliich all have the same energy, and 
which in many cases represents simply the tiwe-ensemhle,, or 
ensemble of phases through which a single system passes in 
the course of time. 

It may therefore seem desirable to find definitions and 
propositions relating to these microcanonical ensembles, which 
shall correspond to what in thermcxlynamics are based on 
experience. Now the differential equation 

dt = V d log V— — 321c ^^2 — etc., (485) 

* See Chapter XIIT, page 160, t See Chapter IV, pages 35-37. 

t See Chapter XIII, pages 102, 163. § See Chapter XII, pages 143-161. 

H See Chapter XIII, page 169. 
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wliich has been demonstrated in Chapter X, and which relates to 
a microcanonical ensemble, ^3^]^ denoting the average value of 
in such an ensemble, corresponds precisely to the thermody¬ 
namic equation, except for the sign of average applied to the 
external forces. But as these forces are not entirely deter¬ 
mined by tlie energy with the external coordinates, the use of 
average values is entirely germane to the subject, and afiV>rds 
the readiest means of getting perfectly determined quantities. 
These averages, which are hiken for a microcanonical ensemble, 
may seem from some points of view a more simple and natural 
conception than those which relate to a canonical ensemble. 
Moreover, the energy, and the quantity corresponding to en¬ 
tropy, are free from the sign of average in this equation. 

The quantity in the equation which corresponds to entropy 
is log V, the quantity V being defined as the extension-in- 
phase within which the energy is less than a certain limiting 
value (e). This is certainly a more simple cM)nception than the 
average value in a canonical ensemble of the index of probabil¬ 
ity of phase. Log V has the property that when it is constant 

= — 371 € + etc., (486) 

which closely corresponds to the thermodynamic property of 
entropy, that when it is constant 

de = — da^ — da^ -f etc. (4^^) 

The quantity in the equation which corresponds to tem¬ 
perature is F, or dejd log V, In a canonical ensemble, the 
average value of this quantity is equal to the modulus, as has 
been shown by different methods in Chapters IX and X. 

In Chapter X it has also been shown that if the systems 
of a microcanonical ensemble consist of parts with separate 
energies, the average value of V for any part is equal to its 
average value for any other part, and to the unifoim value 
of the same expression for tlie whole ensemble. This corre¬ 
sponds to the theorem in the theory of heat that in case of 
thermal equilibrium the temperatures of the parts of a body 
are equal to one another and to that of the whole body. 
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Since the energies of the parts of a body cannot be supposed 
to remain absolutely constant, even where this is tlie case 
with respect to the whole body, it is evident that if we regard 
the temperature as a function of the energy, the taking of 
average or of probable values, or some otlier statistical process, 
must be used with reference to the parts, in order to get a 
perfectly definite value corresponding to the notion of tem¬ 
perature. 

It is worthy of notice in this connection that the average 
value of the kinetic energy, either in a microcanonical en¬ 
semble, or in a canonical, divided by one half the number of 
degrees of freedom, is equal to or to its average value, 

and that this is true not only of the whole system which is 
distributed either microcanonically or canonically, but also 
of any part, although the corresponding theorem relating to 
temperature hardly belongs to empirical thermodynamics, since 
neither tlie (inner) kinetic energy of a body, nor its number 
of degrees of freedom is immediately cognizable to our facul¬ 
ties, and we meet the gravest difficulties when we endeavor 
to apply the theorem to the theory of gases, except in the 
simplest case, that of the gases knowni as monatomic. 

But the correspondence between or de\d\ogV 

temperature is imperfect. If two isolated systems have such 
energies that 

dei _ de2 
d log Vi d log F a ’ 

and the two systems are regarded as combined to form a third 
system with energy 

€12 = €1 -p €a, 


we shall not have in general 

d€i2 dci 

dlog P \2 "" cTlog Fa’ 

as analogy with temperature would require. In fact, we have 
seen that 






<3? log Fi 2 d log Filfia d log Fjeij 


ika ' 
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where the second and third members of the equation denote 
average values in an ensemble in which the compound system 
is microcanonically distributed in phase. Let us suppose the 
two original systems to be identical in nature. Then 


The equation in question would require that 


c/6i 

d log V\ 


dei 


ki2 




i. i\y that we get the same result, whether we take the value 
of de^jdXogV^ determined for tlie average value of €j in the 
ensemble, or take the average value of de^jdlog Vy This 
will be the case where rfej/rHog \\ is a linear function of Cj. 
Evidently this does not constitute the most general case. 
Therefore tlie equation in question cannot \>e true in general. 
It is true, however, in some very important particular cases, as 
when the energy is a (piadratic function of the p^s and </’s, or 
of the jt/s alone.* When the equation holds, the case is anal¬ 
ogous to that of bodies in thermodynamics for which the 
specific heat for constant volume is constant. 

Another quantity which is closely related to temperature is 
d<f) fde. It lias lieen shown in Chapter IX tliat in a canonical 
ensemble, if n > 2, the average value of dcpjde is 1/0, and 
that the most common value of the energy in the ensemble is 
that for which = 1/0. The first of these properties 

may be compared with that of dejd log V, which has been 
seen to have the average value © in a canonical ensemble, 
without restriction in regard to the number of degrees of 
freedom. 

With respect to microcanonical ensembles also, d^^jde has 
a property similar to vvliat has been mentioned with respect to 
de/d log V. That is, if a system microcan onically distributed 
in phase consists of two parts with separate energies, and each 

* This last case is important on account of its relation to the theory of 
gases, although it must in strictness be regarded as a limit of possible cases, 
rather than as a case which is itself possible. 
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with more than two degrees of freedom, the average values in 
the ensemble of dcfy/de for the two parts are equal to one 
another and to the value of same expression for the whole. 
In our usual notations 


dei 


deo 


1*12 




if > 2 , and > 2 . 

This analogy with temperature has the same incompleteness 
which was noticed with respect to de/dlog FJ viz., if two sys¬ 
tems have such energies and 

d<ji2 

de^ (k^ ^ 

and they are combined to form a third system with energy 

€12 = €1 4 * e 2 , 

we shall not have in general 
d(f>i2 _ 

d^vi d^i dc ^ 

Thus, if the energy is a quadratic function of the p^s and y’s, 
we have * 

d < if )\ 9 Zi ~~~ 1 d ( j )2 ^2 1 

dCi (i’62 62 

d<f>i2 _ ^12 — 1 _ w-i - 4 “ ^^2 — 1 

<^612 €12 “f* ^2 

where 721 , n,, riij, are the numbers of degrees of freedom of the 
separate and combined systems. But 

d(pl __ d </>2 _ 72i + 7^2 — 2 
dej dsz -h €2 

If the energy is a quadratic function of the jc)’s alone, the case 
would be the same except that we should have ^ 

instead of , rig, In these particular cases, the analogy 


• See foot-note on page 93. We have here made the least value of the 
energy consistent with the values of the external coordinates zero instead 
of caj-as is evidently allowable when the external coordinates are supposed 
invariable. 
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between dejdlog V and temperature would be complete, as has 
ali’eady been remarked. We should have 

dei €i de^ ^ 

d log Vi" fti^ d log V2 W3’ 

<^€12 _ ^12 _ dCi _ C?62 

d log V12 ” Wi3 cTiogTi ~ d log V2 * 


when the energy is a quadratic function of the p's and ^'’s, and 
similar equations with ^ ^2 ’ i ^ 12 » i^^stead of , 

when the energy is a quadratic function of the p's alone. 

More characteristic of d^j de are its properties relating to 
most probable values of energy. If a system having two parts 
with separate energies and each mth more than two degrees 
of freedom is inicrocanonically distributed in phase, the most 
probable division of energy between the parts, in a system 
taken at random from tlie ensemble, satisfies the equation 


d<^ d(^2 

dei d€2 * 


(488) 


which corresponds to the thermodynamic theorem that the 
distribution of energy between the parts of a system, in case of 
thermal equilibrium, is such that the temperatures of the parts 
are equal. 

To prove the theorem, we observe that the fractional part 
of the whole number of systems which have the energy of one 
part (ci) between the limits e^' and ej" is expressed by 




'A 


1 


01+02 , 
e d€i. 


where the variables are connected by the equation 
-f- €2 = constant = € 12 . 

The greatest value of this expression, for a constant infinitesi¬ 
mal value of the difference Cj" — determines a value of cj, 
which we may call its most probable value. This depends on 
the greatest possible value of <f)^ -f Now if > 2, and 
nj > 2, we shall have = — 00 for the least possible value of 
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e^, and (jE)^ = — oo for the least possible value of € 2 - Between 
these limits and ^2 finite and continuous. Hence 

+ 02 ^ maximum satisfying the equation (488). 

But if Tij S 2, or ^2 ~ 2, d(f)Jd€i or d 02/^^2 ^ nega¬ 

tive, or zero, for all values of or 62 , and can hardly be 
regarded as having properties analogous to temperature. 

It is also worthy of notice that if a system which is micro- 
canonically distributed in phase has tliree parts with separate 
energies, and each with more than two degrees of freedom, the 
most probable division of energy between these parts satisfies 
the equation 

d(f}2 ^08 

d€i de^ de^ * 

That is, this equation gives the most probable set of values 
of fj, 62 » ^ 3 * it does not give the most probable 

value of ej, or of 63 , or of € 3 . Thus, if the energies are quad¬ 
ratic functions of the ^’s and 9 ’s, the most probable division 
of energy is given by the equation 

ni — 1 __ /la — 1 Ws — 1 

■ 

But the most probable value of €1 is given by 
— 1 _ na + — 1 

€i ^2 4 - fg 

while the preceding equations give 

ni 1 _ 72 ^ 4- rig — 2 

€2 -h eg 

These distinctions vanish for very great values of 
For small values of these numbers, they are important. Such 
facts seem to indicate that the consideration of the most 
probable division of energy among the parts of a system does 
not afford a convenient foundation for the study of thennody- 
namic analogies in the case of systems of a small number of 
degrees of freedom. The fact that a certain division of energy 
is the most probable has really no especial physical importance, 
except when the ensemble of possible divisions are grouped so 
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closely together tliat the most probable division may fairly 
represent the whole. This is in general the case, to a very 
close approximation, when n is enormously great; it entirely 
fails when n is small. 

If we regard d(f>lde as corresponding to tlie reciprocal of 
temperature, or, in other words, de/dcf) as corresponding to 
temperature, will correspond to entropy. It lias been defined 
as log {d V/de). In the considerations on which its definition 
is founded, it is therefore veiy similar to log V. We have 
seen that t/c^/dlog V approaches the value unity when 7i is 
very great.* 

To form a differential equation on the model of the thermo¬ 
dynamic equation (482), in which dejdif) shall take the place 
of tempeniture, and ^ of entropy, we may write 

* = ($)/*+ 

or <14,= + + + (490) 

With respect to the differential coefficients in the last equa¬ 
tion, which corresponds exactly to (482) solved with respect 
to dTj, we have seen that their average values in a canonical 
ensemble are equal to 1 /©, and the averages of A^/&y A^/Qy 
etc.f We bive also seen tliat dejdcf) (or d(f>/d€) lias relations 
to the most probable values of energy in parts of a microca- 
nonical ensemble. That ((?e/ etc., have properties 

somewhat analogous, may be shown as follows. 

In a physical experiment, we measure a force by balancing it 
against another. If we should ask what force applied to in¬ 
crease or diminish would balance the action of the systems, 
it would be one which varies witli the different systems. But 
we may ask what single force will make a given value of 
the most probable, and we shall find that under certain condi¬ 
tions (d€/da^)ff,^a represents that force. 

* See Chapter X, pages 120, 121. 
t See Chapter IX, equations (321), (327). 
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To make the problem definite, let us consider a system con¬ 
sisting of the original system together with another having 
the coordinates , ag , etc., and forces etc., tending 

to increase those coordinates. These are in addition to the 
forces A^, A^, etc., exerted by the original system, and are de¬ 
rived from a force-function (— e^') by the equations 







etc. 


For the energy of the whole system we may write 
E = € -h -f -f- ^ mg -h etc., 


and for the extension-in-pliase of the whole system witliin any 
limits 


or 

or again 


J*. . dpi . . . dq„ dai nil dui da^ mg dtXg . . 

S ' * * ^2 • • •> 

J*. . dE dai mi dcii da^ mg da2 . . ., 


since de = dE, when aj, ai, a 2 , as, etc., are constant. If the 
limits are expressed by E and E + dE, and and 

+ da ^, etc., the integral reduces to 

dE dai mi dui da^ m^ da^ ... 


The values of , a^, a^y etc., which make this expression 
a maximum for constant values of the energy of the whole 
system and of the differentials dE, da ^, da ^, etc., are what may 
be called the most probable values of etc., in an ensem¬ 

ble in which the whole system is distributed microcanonically. 
To determine these values we have 

de^ = 0 , 

d(€ + -t- i m Oi^ + i + etc.) = 0. 

d<f>-Oy 


when 
That is, 
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when 


« \rt«iy0,a 


This requires «i = 0, 0, etc., 




This sliows that for any given values of E, aj, aa, etc. 



, etc., represent tho forces (in the gen¬ 


eralized sense) wliieli the external bodies would have to exert 
to make these values of etc., the most probable under 

the conditions specified. When tlie differences of the external 
forces which are exerted by the different systems are negli¬ 
gible,— (^de/ etc., represent these forces. 

It is certainly in the quantities relating to a canonical 
ensemble, e, 0, ..1^, etc., a^, etc., that we find the most 

complete correspondence with the quantities of the tliermody- 
namic equation (482). Yet the conce[)tion itself of the canon¬ 
ical ensemble may seem to some artificial, and hardly germane 
to a natural exposition of the subject; and the quantities 


c^iogT’ ^tc., aj, etc., ore, 

etc., , etc., which are closely related to ensembles of constant 
energ 3 % and to average and most probable values in such 
ensembles, and most of which are defined without reference 
to any ensemble, may appear the most natural analogues of 
the thermodynamic quantities. 

In regard to the naturalness of seeking analogies with the 
thermodynamic behavior of bodies in canonical or microca- 
nonical ensembles of systems, much will depend upon how we 
approach the subject, especially upon the question whether we 
regard energy or temperature as an independent variable. 

It is very natural to take energy for an independent variable 
rather than temperature, because ordinary mechanics furnishes 
us with a perfectly defined conception of enei^, whereas the 
idea of something relating to a mechanical system and corre- 
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spending to temperature is a notion but vaguely defined. Now 
if the state of a system is given by its energy and the external 
coordinates, it is incompletely defined, although its partial defi¬ 
nition is perfectly clear as far as it goes. The ensemble of 
phases microcanonically distributed, with the given values of 
the energy and the extenuil coordinates, will represent the im¬ 
perfectly defined system better than any otlier ensemble or 
single phase. When we approach the subject from this side, 
our theorems will naturally relate to average values, or most 
probable values, in such ensembles. 

In this case, the clioice between the variables of (485) or of 
(489) will be determined partly by the relative importance 
which is attached to average and probable values. It would 
seem that in general average values are the most important, and 
that they lend themselves better to analytical transfoi mations. 
Tliis consideration would give the preference to the system of 
variables in which log V is the analogue of entropy. Moreover, 
if we make (f> the analogue of entropy, we ai’e embarnissed by 
the necessity of making numerous exceptions for systems of 
one or two degrees of freedom. 

On the other hand, the definition of <f) may be regarded as a 
little more simple than that of log Vy and if our choice is deter¬ 
mined by the simplicity of the definitions of the analogues of 
entropy and temperature, it would seem that the (f> system 
should have the preference. In our definition of these quanti¬ 
ties, V was defined first, and derived from V by differen¬ 
tiation. This gives the relation of the quantities in the most 
simple analytical form. Yet so far as the notions are con¬ 
cerned, it is perhaps more natural to regard V as derived from 

by integration. At all events, may be defined inde¬ 
pendently of F, and its definition may be regarded as more 
simple as not requiring the determination of the zero from 
which V is measured, which sometimes involves questions 
of a delicate nature. In fact, the quantity may exist, 
when the definition of V becomes illusory for practical pur¬ 
poses, as the integral by which it is determined becomes infinite. 

The case is entirely different, when we regard the temperar 
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lure as an independent variable, and we have to consider a 
system which is described as having a certain temperature and 
certain values for the external coordinates. Here also the 
state of the system is not completely defined, and will be 
better represented by an ensemble of phases than by any single 
phase. What is the nature of such an ensemble as will best 
represent the imperfectly defined stixte ? 

When we wish to give a body a certain temperature, we 
place it in a bath of the proper temperature, and when we 
regard what we call thermal equilibrium as established, we say 
that the body has the same temperature as the bath. Per¬ 
haps we place a second body of standard character, which we 
call a thermometer, in the bath, and say that the first body, 
the bath, and the thermometer, have all the same temperature. 

But the body under such circumstances, as well as the 
bath, and the thermometer, even if they were entirely isolated 
from external influences (which it is convenient to suppose 
in a theoretical discussion), would be continually changing in 
phase, and in energy as well as in other respects, although 
our means- of observation are not fine enough to perceive 
these variations. 

The series of phases through wiiich the whole system runs 
in the course of time may not be entirely determined by the 
energy, but may depend on the initial phase in other respects. 
In such cases the ensemble obtained by the niicrocanonical 
distribution of the whole system, which includes all possible 
time-ensembles combined in the proportion which seems least 
arbitrary, will represent better than any one time-ensemble 
the effect of tlie bath. Indeed a single time-ensemble, when 
it is not also a niicrocanonical ensemble, is too ill-defined a 
notion to serve the purposes of a general discussion. We 
will therefore direct our attention, when we suppose the body 
placed in a bath, to the microcanonical ensemble of phases 
thus obtained. 

If we now suppose the quantity of the substance forming 
the bath to be increased, the anomalies of the separate ener¬ 
gies of the body and of the thermometer in the microcanonical 
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ensemble will be increased, but not without limit. The anom¬ 
alies of the energy of the bath, considered in comparison with 
its whole energy, diminish indefinitely as the quantity of the 
bath is increased, and become in a sense negligible, when 
the quantity of the bath is sufficiently increased. The 
ensemble of phases of the body, and of the thermometer, 
approach a standard form as the quantity of the bath is in¬ 
definitely increased. This limiting form is easily shown to Ix^ 
what we have described as the canonical distribution. 

Let us write e for the energy of the whole system consisting 
of the body first mentioned, the bath, and the thermometer 
(if any), and let us first suppose this system to be distributed 
canonically with the modulus We have by (205) 




and since 



de n de 



If we write Ae for the anomaly of mean square, we have 


If we set 


(Ae)^=(e-6)«. 

d0 

A0 = 

de 


A© will represent approximately the increase of 0 which 
would produce an increase in the average value of the energy 
equal to its anomaly of mean square. Now these equations 
give 


(A0)» = 


20"d€p 

n de ^ 


which shows that we may diminish A© indefinitely by increas¬ 
ing the quantity of the bath. 

Now our canonical ensemble consists of an infinity of micro- 
canonical ensembles, which differ only in consequence of the 
different values of the energy which is constant in each. If 
we consider separately the phases of the first body which 
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occur in the canonical ensemble of the whole system, these 
phases will form a canonical ensemble of the same modulus. 
This canonical ensemble of phases of the first body will con¬ 
sist of parts which belong to the different microcanonical 
ensembles into which the canonical ensemble of the whole 
system is divided. 

Let us now imagine that the modulus of the principal ca¬ 
nonical ensemble is increased by 2 A@, and its average energy 
by 2 A 6. The modulus of the canonical ensemble of the 
phases of the first body considered separately will be increased 
b}^ 2 AO. We may regard the infinity of microcanonical en¬ 
sembles into which we have divided the principal canonical 
ensemble as each having its energy increased by 2Ae. Let 
us see how the ensembles of phases of the first body con¬ 
tained in these microcanonical ensembles are affected. We 
may assume that they will all be affected in about the same 
way, as all the differences which come into account may be 
treated as small. Therefore, the canonical ensemble formed by 
taking them together will also be affected in the same way. 
But we know how this is affected. It is by the increase of 
its modulus by 2A@, a quantity which vanishes when the 
quantity of the bath is indefinitely increased. 

In the case of an infinite bath, therefore, the increase of the 
energy of one of the microcanonical ensembles by 2At, pro¬ 
duces a vanishing effect on the distribution in energy of the 
phases of the first body which it contains. But 2Ae is more 
than the average difference of energy between the micro- 
canonical ensembles. The distribution in energy of these 
phases is therefore the same in the different microcanonical 
ensembles, and must therefore be canonical, like that of the 
ensemble which they form when taken together.^ 

♦ Jn order to appreciate the above reasoning, it should be understood that 
the differences of energy which occur in the canonical ensemble of phases of 
the first body are not here regarded as vanishing quantities. To fix one^s 
ideas, one may imagine that he has the fineness of perception to make these 
differences seem large. The difference between the part of these phases 
which belong to one microcanonical ensemble of the whole system and the 
part which belongs to another would still be imperceptible, when the quan¬ 
tity of the bath is sufficiently increased. 
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As a general theorem, the conclusion may be expressed in 
the words: — If a system of a great number of degrees of 
freedom is microcanonically distributed in phase, any very 
small part of it may be regarded as canonically distributed.* 

It would seem, therefore, that a canonical ensemble of 
phases is what best represents, with the precision necessary 
for exact mathematical reasoning, the notion of a body with 
a given temperature, if we conceive of the temperature as the 
state produced by such processes as we actually use in physics 
to produce a given temperature. Since the anomalies of the 
body increase with the quantity of the bath, we can only get 
rid of all that is arbitrary in the ensemble of phases which is 
to represent the notion of a body of a given temperature by 
making the bath infinite, which brings us to the canonical 
distribution. 

A comparison of temperature and entropy with their. ana¬ 
logues in statistical mechanics would be incomplete without a 
consideration of tlieir differences with respect to units and 
zeros, and the numbers used for their numerical specification. 
If we apply the notions of statistical mechanics to such bodies 
as we usually consider in thermodynamics, for which the 
kinetic energy is of the same order of magnitude as the unit 
of energy, but the number of degrees of freedom is enormous, 
the values of @, dejdlogV^ and deld<f> will be of the same 
order of magnitude as l/?i, and the variable part of 17 , log FJ 
and <f> will be of the scime order of magnitude as n.f If these 
quantities, therefore, represent in any sense the notions of tem¬ 
perature and entropy, they will nevertheless not be measured 
in units of the usual order of. magnitude, — a fact which must 
be borne in mind in determining what magnitudes may be 
regarded as insensible to human observation. 

Now nothing prevents our supposing energy and time in 
our statistical formulae to be measured in such units as may 

♦ It is assumed — and without this assumption the theorem would have 
no distinct meaning—that the part of the ensemble considered may be 
regarded as having separate energy. 

t See equations (124), (288), (289), and (314); also page 106. 
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be convenient for physical purposes. But when these units 
have been chosen, the numerical values of 0 , dejdlogV^ 
d€/d(f>^ rj^ log Fi <f>, are entirely determined,* and in onier to 
compare them with temperature and entropy, the numerical 
values of which depend upon an arbitrary unit, we must mul¬ 
tiply all values of 0 , dfe/tZlogiF, de/t?(/> by a constant (-ST)* 
and divide all values of 77 , log F, and (\> by the same constant. 
This constant is the same for all bodies, and depends only on 
the units of temperature and energj^ wliich we employ. For 
ordinaiy units it is of the same order of magnitude as the 
numbers of atoms in ordinaiy bodies. 

We are not able to detennine the numerical value of iT, 
as it depends on the nurnl^er of molecules in the bodies with 
which we experiment. To fix our ideas, however, we may 
seek an expression for this value, based upon very probable 
assumptions, which will show how we would naturally pro¬ 
ceed to its evaluation, if our powers of observation were fine 
enough to take cognizance of individual molecules. 

If the unit of mass of a monatomic gas contains v atoms, 
and it may be treated as a system oi Zv degrees of free¬ 
dom, which seems to be the case, we have for canonical 
distribution 

Cp = f v0, 


d0 




(491) 


If we write T for temperature, and for the specific heat of 
the gas for constant volume (or rather the limit toward 
which this specific heat tends, as rarefaction is indefinitely 
increased), we have 



(492) 


since we may regard the energy as entirely kinetic. We may 
set tlie €p of this equation equal to the €p of the preceding, 

* The unit of time only affects the last three quantities, and these only 
by an additive constant, which disappears (with the additive constant of 
entropy), when differences of entropy are compared with their statistical 
analogues. See page 19. 
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where indeed the individual values of which the average is 
taken would appear to human observation as identical. This 
gives 

2c^ 


whence 


~dT 

3i/ 


(493) 


a value recognized by physicists as a constant independent of 
the kind of monatomic gas considered. 

We may also express the value of jBTin a somewhat different 
form, which corresponds to the indirect method by which 
physicists are accustomed to determine the quantity The 
kinetic energy due to the motions of the centers of mass of 
the molecules of a mass of gas sufficiently expanded is easily 
shown to be equal to 

where jt) and v denote the pressure and volume. The average 
value of the same energy in a canonical ensemble of such 
a mass of gas is 

§ 0i/, 


where v denotes the number of molecules in the gas. 
ing these values, we have 

pv = 


whence 


1 _ 0 __pv 


Equat- 

(494) 

(495) 


Kow the laws of Boyle, Charles, and Avogadro may be ex¬ 
pressed by the equation 

pv = A vT, (49G) 

where A is a constant depending only on the units in which 
energy and temperature are measured. 1 / A", therefore, might 
be called the constant of the law of Boyle, Charles, and 
Avogadro as expressed with reference to the true number of 
molecules in a gaseous body. 

Since such numbers are unknown to us, it is more conven¬ 
ient to express the law with reference to relative values. If 
we denote by M the so-called molecular weight of a gas, that 
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is, a number taken from a table of numbers proportional to 
the weights of various molecules and atoms, but having one 
of the values, perhaps the atomic weight of hydrogen, arbi¬ 
trarily made unity, the law of Boyle, Charles, and Avogadro 
may be written in the more practical form 

pv^A’T-^, (497) 

where A' is a constant and m the weight of gas considered. 
It is evident that 1 JT is equal to tlie product of the constant 
of the law in this form and the (true) weight of an atom of 
hydrogen, or such other atom or molecule as may be given 
the value unity in tlie table of molecular weights. 

In the following chapter we shall consider the necessary 
modifications in the theory of equilibrium, when the quantity 
of matter contained in a system is to be regarded as variable, 
or, if the system contains more than one kind of matter, 
when the quantities of the several kinds of matter in the 
system are to be regarded as independently variable. This 
will give us yet another set of variables in the statistical 
equation, corresponding to those of the amplified form of 
the thermodynamic equation. 



CHAPTER XV. 


SYSTEMS COMPOSED OF MOLECULES. 

The nature of material bodies is such that especial interest 
attaches to the dynamics of systems composed of a great 
number of entirely similar particles, or, it may be, of a great 
number of particles of several kinds, all of each kind being 
entirely similar to each other. We shall therefore proceed to 
consider systems composed of such particles, whether in great 
nuinbers or otherwise, and especially to consider the statistical 
equilibrium of ensembles of such systems. One of the varia¬ 
tions to be considered in regard to such systems is a variation 
in the numbers of the particles of the various kinds wdiich it 
contains, and the question of statistical equilibrium between 
two ensembles of such systems relates in part to the tendencies 
of the various kinds of particles to pass from the one to the 
other. 

First of all, we must define precisely what is meant by 
statistical equilibrium of such an ensemble of systems. The 
essence of statistical equilibrium is the permanence of the 
number of systems which fall within any given limits with 
respect to phase. We have therefore to define how the term 
“ phase ” is to be understood in such cases. If two phases differ 
only in that certain entirely similar particles have changed 
places with one another, are they to be regarded as identical 
or different phases? If the particles are regarded as indis¬ 
tinguishable, it seems in accordance with the spirit of the 
statistical method to regard the phases as identical. In fact, 
it might be urged that in such an ensemble of systems as we 
are considering no identity is possible between the particles 
of different systems except that of qualities, and if v particles 
of one system are described as entirely similar to one another 
and to V of another system, nothing remains on which to base 
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the indentification of any particular particle of the first system 
with any particular particle of the second. And this would 
be true, if the ensemble of systems had a simulfiineous 
objective existence. But it liardly applies to the creations 
of the imagination. In the cases which we have been con¬ 
sidering, and in those which we shall consider, it is not only 
possible to conceive of the motion of an ensemble of similar 
systems simply as possible cases of the motion of a single 
system, but it is actually in large measure for the sake of 
representing more clearly the possible cases of the motion of 
a single system that we use the conception of an ensemble 
of systems. The perfect similarity of several particles of a 
system will not in the least interfere with the identification 
of a particular particle in one case with a particular particle 
in another. The question is one to be decided in accordance 
with the requirements of practical convenience in the discus¬ 
sion of the problems with which we are engaged. 

Our present purppse will often require us to use the terms 
phase^ deuBity-in-phasey statistical equilibriumy and other con¬ 
nected terms on the supposition that phases are not altered 
by the exchange of places between similar particles. Some 
of the most important questions with which we are concerned 
have reference to phases thus defined. We shall call them 
phases determined by generic definitions, or briefly, generic 
phases. But we shall also be obliged to discuss phases de¬ 
fined by the narrower definition (so that exchange of position 
between similar particles is regarded as changing the phase), 
which will be called phases determined by specific definitions, 
or briefly, specific phases. For the analytical description of 
a specific phase is more simple than that of a generic phase. 
And it is a more simple matter to make a multiple integral 
extend over all possible specific phases than to make one extend 
without repetition over all possible generic phases. 

It is evident that if vi, V 2 • • • numbers of the dif¬ 

ferent kinds of molecules in any system, the number of specific 
phases embraced in one generic phase is represented by the 
continued product [fa • • • the coefficient of probabil- 
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ity of a generic phase is the sum of the probability-coefficients 
of the specific phases which it represents. When these are 
equal among themselves, the probability-coefficient of the gen¬ 
eric phase is equal to that of the specific phase multiplied by 
ki li^* • * lii: evident that statistical equilibrium 

may subsist with respect to generic phases without statistical 
equilibrium with respect to specific phases, but not vice versa. 

Similar questions arise where one particle is capable of 
several equivalent positions. Does the change from one of 
these positions to another change the phase ? It would be 
most natural and logical to make it affect the specific phase, 
but not the generic. The number of specific phases contained 
in a generic phase would then be where 

a:^, . . . Kf^ denote the numbers of equivalent positions belong¬ 
ing to the several kinds of particles. The case in which a /c is 
infinite would then require especial attention. It does not 
appear tliat the resulting complications in the formulae would 
be compensated by any real advantage. The reason of this is 
that ill problems of real interest equivalent positions of a 
particle will always be equally probable. In this respect, 
equivalent positions of the same particle are entirely unlike 
tlie [v different ways in which v particles may lye distributed 
in p different positions. Let it therefore be understood that 
in spite of the physical equivalence of different positions of 
the same particle they are to be considered as constituting a 
difference of generic phase as well as of specific. The number 
of specific phases contained in a generic phase is therefore 
always given by the product | 

Instead of considering, as in the preceding chapters, en¬ 
sembles of systems differing only in phase, we shall now 
suppose that tlie systems constituting an ensemble are com¬ 
posed of particles of various kinds, and that they differ not 
only in phase but also in the numbers of these particles Avhich 
they contain. The external coordinates of all tlie systems in 
the ensemble are supposed, as heretofore, to have the same 
value, and when they vary, to vary together. For distinction, 
we may call such an ensemble a grand ensemble, and one in 
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which the systems differ only in phase a petit ensemble, A 
grand ensemble is therefore composed of a multitude of petit 
ensembles. The ensembles which we have hitherto discussed 
are petit ensembles. 

Let etc., denote the numbers of the different 

kinds of particles in a system, e its energy, and $'i, . . . g'n ^ 
jPj , ... ^„ its coordinates and momenta. If the particles are of 
the nature of material points, the number of coordinates (n) 
of the system will be equal to 3. • . + 3But if the parti¬ 
cles are less simple in their nature, if they are to be treated 
as rigid solids, the orientation of which must be regarded, or 
if they consist each of several atoms, so as to have more than 
three degrees of freedom, the number of coordinates of the 
system will be equal to the sum of etc., multiplied 

each by the number of degrees of freedom of the kind of 
particle to which it relates. 

Let us consider an ensemble in which the number of 
systems having particles of the several kinds, and 

having values of their coordinates and momenta lying between 
the limits and q^ + dq ^, p^ and /q 4- dq )^, etc., is represented 
by the expression 

Ne ® 

-(498) 

where JY, fl, 0, , . . . are constants, N denoting the total 

number of systems in the ensemble. The expression 

n-f-AtiVi... -^fih^k-9 

iV'e * (499) 

b • • -h 

evidently represents the density-in-phase of the ensemble 
within the limits described, that is, for a phase specifically 
defined. The expression 


• • ‘ks 


( 500 ) 
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is therefore the probability-coefficient for a phase specifically 
defined. This has evidently the same value for all the 
lii • • • obtained by interchanging the phases of 

particles of the same kind. The probability-coefficient for a 
generic phase will be . . [i^times as great, viz., 


e ® (501) 


We shall say that such an ensemble as has been described 
is canonically distrihuted^ and shall call the constant 0 its 
modulus. It is evidently what we have called a grand ensem¬ 
ble. The petit ensembles of which it is composed are 
canonically distributed, according to the definitions of Chapter 
IV, since tlie expression 


b • • • b 


(502) 


is constant for each petit ensemble. The grand ensemble, 
therefore, is in statistical equilibrium with respect to specific 
phases. 

If an ensemble, whether grand or petit, is identical so far 
as generic phases are concerned wdth one canonically distrib¬ 
uted, we shall say that its distribution is canonical with 
respect to generic phases. Such an ensemble is evidently in 
statistical equilibrium with respect to generic phases, although 
it may not be so with respect to specific phases. 

If we write H for the index of probability of a generic phase 
in a grand ensemble, we have for the case of canonical 
distribution 


H = 


Q -h Ml «»* + y* g 

© 


(503) 


It will be observed that the H is a linear function of c and 
i^i, . . . 1 /;^; also that whenever the index of probability of 
generic phases in a grand ensemble is a linear function of 
€, i/j, . . . I/*, the ensemble is canonically distributed with 
respect to generic phases. 
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The constant II we may regard as determined by the 
equation 


i\r= 5^^... 2.^ 


or 




phases 


In * • • li>- 


dp^.. . dq^, ( 504 ) 




e ® ~ Svj. . . 


e 


0 


In • “b. 


J aU , 

»dp,...dq„ (505) 


phases 


where the multiple sum indicated by 2^^ . . . 2^* includes all 
terms obtained by giving to each of the symbols I'l . . . i/* all 
integral values from zero upward, and the multiple integral 
(which is to be evaluated separately for each term of the 
multiple sum) is to be extended over all the (specific) phases 
of the system having the specified numbers of particles of the 
various kinds. The multiple integral in the last equation is 

-t 

what we have represented by e See equation (92). We 
may therefore write 


A 

= 






(506) 


It should be observed that the summation includes a term 
in which all the symbols ... have the value zero. We 
must therefore recognize in a certain sense a system consisting 
of no particles, which, although a ban-en subject of study in 
itself, cannot well be excluded as a particular case of a system 
of a variable number of particles. In this case e is constant, 
and there are no integrations to be performed. We have 
therefore* 

a ® = e L e., = e. 


* This conclusion may appear a little strained. The original definition 
of may not be regarded as fairly applying to systems of no degrees of 
freedom. We may therefore prefer to regard these equations as defining 
^ in this case. 
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The yalue of is of course zero in this case. But the 
value of contains an arbitrary constant, which is generally 
determined by considerations of convenieiice, so that and € 
do not necessarily vanish with ... 

Unless — LI has a finite value, our foimulae become illusory. 
We have already, in considering petit ensembles canonically 
distributed, found it necessary to exclude cases in which — yjr 
has not a finite value.* The same exclusion would here 
make — 'sjr finite for any finite values of i/j . . . Vf^, This does 
not necessarily make a multiple series of the form (606) finite. 
We may observe, however, that if for all values of Pf^ 


— l{/ ^ Cq + Cl Vly ... + Cj, Vf,, 

where Cq, Cj, . . . are constants or functions of 0, 

Co-f (Mi+Ci)»'i . . . 

0 

e 


l. €., 


t. e.. 


_n 

0 


<2 ... 2 
= yi n 


Mi+Ci 


[n- ..[v. 


e B e 

€ < e 2, 


— 2 . 

Vi [n * *' 

—f? 

0^ e e ® 
e <e e . . . e 


( 507 ) 


e.. 


n . Co 


Mi+Ci 




0 0 


(508) 


The value of — XI will therefore be finite, when the condition 
(507) is satisfied. If therefore we assume that — XI is finite, 
we do not appear to exclude any cases which are analogous to 
those of nature.t 

The interest of the ensemble which has been described lies 
in the fact that it may be in statistical equilbrium, both in 

* See Chapter IV, page 36. 

t If the external coordinates determine a certain volume within which the 
system is confined, the contrary of (607) would imply that we could obtain 
an infinite amount of work by crowding an infinite q^uantity of matter into a 
finite volume. 
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respect to exchange of energy and exchange of particles, with 
other grand ensembles canonically distributed and having the 
same values of 0 and of the coefficients /xj, /Xj* when the 
circumstances are such that exchange of energy and of 
particles are possible, and when equilibrium would not sub¬ 
sist, were it not for equal values of these constants in the two 
ensembles. 

With respect to the exchange of energy, the case is exactly 
the same as that of the petit ensembles considered in Chapter 
IV, and needs no especial discussion. The question of ex¬ 
change of particles is to a certain extent analogous, and may 
be treated in a somewhat similar manner. Let us suppose 
that we have two grand ensembles canonically distributed 
with respect to specific phases, with the same value of the 
modulus and of the coefficients /Xj . . . /x;^, and let us consider 
the ensemble of all the systems obtained by combining each 
system of the first ensemble with each of the second. 

The probability-coefficient of a generic phase in the first 
ensemble may be expressed by 

> • ■ 4-Ma 

6 ® (509) 


The probability-coefficient of a specific phase will then be 
expressed by 


e 




(610) 


since each generic phase comprises specific phases. 

In the second ensemble the probability-coefficient^; of the 
generic and specific phases will be 

c ® , (511) 


Ci"+Hi9x '... 4- 



and 


(612) 
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The probability-coefficient of a generic phase in the third 
ensemble, which consists of systems obtained by regarding 
each system of the first ensemble combined with each of the 
second as forming a system, will be the product of the proba¬ 
bility-coefficients of the generic phases of the systems com¬ 
bined, and will therefore be represented by the formula 


e ® (513) 

where = fl' + H", e'" = e' H- z//" = -t- etc. It 

wiU be observed that etc., represent the numbers of 
particles of the various kinds in the third ensemble, and e'" 
its energy; also that fl'" is a constant. The third ensemble 
is therefore canonically distributed with respect to generic 
phases. 

If all the systems in the same generic phase in the third 
ensemble were equably distributed among the 1spe¬ 
cific phases which are comprised in the generic phase, the prob¬ 
ability-coefficient of a specific phase would be 




e 




(514) 


In fact, however, the probability-coefficient of any specific 
phase which occurs in the third ensemble is 





(515) 


which we get by multiplying the probability-coefficients of 
specific phases in the first and second ensembles. The differ¬ 
ence between the formulae (514) and (516) is due to the fact 
that the generic phases to which (513) relates include not 
only the specific phases occurring in the tliird ensemble and 
having the probability-coefficient (516), but also all the 
specific phases obtained from these by interchange of similar 
paiiicles between two combined systems. Of these the proba- 
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bility-coefficient is evidently zero, as they do not occur in the 
ensemble. 

Now this third ensemble is in statistical equilibrium, with 
respect both to specific and generic pliases, since the ensembles 
from which it is formed are so. This statistical equilibrium 
is not dependent on the equality of the modulus and the co-effi¬ 
cients , . . . in the first and second ensembles. It depends 
only on the fact that the two original ensembles were separ¬ 
ately in statistical equilibrium, and that there is no interaction 
between them, the combining of the two ensembles to form a 
third being purely nominal, and involving no physical connec¬ 
tion. This independence of the systems, determined physically 
by forces wliich prevent particles from passing from one sys¬ 
tem to the other, or coming within range of each other’s action, 
is represented mathematically by infinite values of the energy 
for particles in a space dividing the systems. Such a space 
may be called a diaphragm. 

If we now suppose that, when we combine the systems of 
the two original ensembles, the forces are so modified that the 
energy is no longer infinite for particles in all the space form¬ 
ing the diaphragm, but is diminished in a part of this space, 
so that it is possible for paiticles to pass from one system 
to the other, this will involve a change in the function e'" 
which represents the energy of the combined systems, and the 
equation e'" = e' -f e" will no longer hold. Now if the co¬ 
efficient of probability in the third ensemble were represented 
by (513) with this new function we should have statistical 
equilibrium, with respect to generic phases, although not to 
specific. But this need involve only a trifling cliange in the 
distribution of the third ensemble,* a change represented by 
the addition of compamtively few systems in which the trans¬ 
ference of particles is taking place to the immense number 


• It will be observed that, so far as the distribution is concerned, very 
large and infinite values of c (for certain phases) amount to nearly the same 
thing, — one representing the total and the other the nearly total exclusion 
of the phases in question. An infinite change, therefore, in the value of c 
(for certain phases) may represent a vanishing change in the distribution. 
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obtained by combining the two original ensembles. The 
difference between the ensemble which would be in statistical 
equilibrium, and that obtained by combining the two original 
ensembles may be diminished without limit, while it is still 
possible for particles to pass from one system to another. In 
this sense we may say that the ensemble formed by combining 
the two given ensembles may still be regarded as in a state of 
(approximate) statistical equilibrium with respect to generic 
phases, when it has been made possible for particles to pass 
between the systems combined, and when statistical equilibrium 
for specific phases has therefore entirely ceased to exist, and 
when the equilibrium for generic phases would also have 
entirely ceased to exist, if the given ensembles had not been 
canonically distributed, with respect to generic phases, with 
the same values of @ and /x,j, . . . 

It is evident also that considerations of this kind will apply 
separately to the several kinds of particles. We may diminish 
the energy in the space forming the diaphragm for one kind of 
pxrticle and not for another. This is the mathematical ex¬ 
pression for a “ semipermeable” diaphragm. The condition 
necessary for statistical equilibrium where the diaphragm is 
permeable only to particles to which the suffix ( relates 
will be fulfilled when and @ have the same values in the 
two ensembles, although the other coefficients ftg, etc., may be 
different. 

This important property of grand ensembles with canonical 
distribution will supply the motive for a more particular ex¬ 
amination of the nature of such ensembles, and especially of 
the comparative numbers of systems in the several petit en¬ 
sembles which make up a grand ensemble, and of the average 
values in the grand ensemble of some of the most important 
quantities, and of the average squares of the deviations from 
these average values. 

The probability that a system taken at random from a 
grand ensemble canonically distributed will have exactly 
i/j, . . . particles of the various kinds is expressed by the 
multiple integral 
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0 

-rfjPi... dq„ (616) 

! ^ . (617) 

• • • ii:;* 

This may be called the probability of the petit ensemble 
(j/i, . . . Vh'), The sum of all such probabilities is evidently 
unity. That is, 


S P 

phAoea 


which agrees with (506). 

The average value in the grand ensemble of any quantity 
w, is given by the formula 


u — 2,/j 



phases 


0 

In-’-b. 


dp^, . . dq^, (519) 


If w is a function of . . . z'* alone, i, e., if it has the same 
value in all systems of any same petit ensemble, the formula 
reduces to 

0 

“ = - (520) 


Again, if we write ^ grand ^-nd wjpetit to distinguish averages in 
the grand and petit ensembles, we shall have 

^ grand — 2^, • • • 2,,^^ ^petlt i j * (621) 


In this chapter, in which we are treating of grand en¬ 
sembles, u will always denote the average for a grand en¬ 
semble. In the preceding chapters, u has always denoted 
the average for a petit ensemble. 
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Equation (605), which we repeat in a slightly different 
form, viz., 


e ® 


lUl 

f-r- 


Ml*'! • • • +Ma»'V 


b • • • b 


dpi . . . dq^, (522) 


shows that H is a function of 0 and /ttj, ••• fi)^; also of the 
external coordinates ag, etc., which are involved implicitly 
in €. If we differentiate the equation regarding all these 
quantities as variable, we have 




Mi^i ■. --i-MtyA-f 

Ml*'! • • • 4*M»»'jk-^ 


dpi--.dqn 




4- etc. 


Ml*'! • * • +ma»'»-^ 


—^ 2 ,^... 2 .J ...j ^ 


— etc. 


If we multiply this equation by e®, and set as usual -4p 
etc., for — de/da^^ — de/da^^ etc., we get in virtue of the law 
expressed by equation (519), 

O c?© - - - 


dfii- dfij- 

+ ^^i + ^^. + etc.; 


(624) 
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that is, 

dn = ^ - _ 2 n dim - 2 Ji d^, (525) 

(jj) 

Since equation (503) gives 

= H, (626) 

the preceding equation may be written 


dQ = nd® - S - S Ii da^. (527) 

Again, equation (526) gives 

dQ -4- 2 /Lti dvi + 2 vi dfxi — de ~ ® dH H d0. (528) 

Eliminating dfl from these equations, we get 

— 0dH 4- 2/iid7i ~ 2 Aidai. (529) 

If we set ^ = € -f ® H, (530) 

=: * + 0 dH 4- H d0, (531) 

we have d'l' = H d0 + S dvi — 2iri dai, (632) 

The corresponding thermodynamic equations are 

de = Tdiy -f 2 fAidmi — 2 Aj dai , (533) 

(534) 

di/r = —-i7dT+2ftid?7Zi — 2Aidai. (535) 


These are derived from the thermodynamic equations (114) 
and (IIT) by the addition of the terms necessary to take ac¬ 
count of variation in the quantities (m^, mg, etc.) of the 
several substances of which a body is composed. The cor¬ 
respondence of the equations is most perfect when the com¬ 
ponent substances are measured in such units that m^, mg, 
etc., are proportional to the numbers of the different kinds 
of molecules or atoms. The quantities /ij, /ttg, etc., in these 
thermodynamic equations may be defined as differential coeffi¬ 
cients by either of the equations in which they occur.* 


* Compare Transactiona Connecticut Academy, Vol. in, pages 116 ft 
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If we compare the statistical equations (529) and (582) 
with (114) and (112), which are given in Chapter IV, and 
discussed in Chapter XIV, as analogues of thermody¬ 
namic equations, we find considerable difference. Beside the 
terms corresponding to the additional terms in the thermo¬ 
dynamic equations of this chapter, and beside the fact that 
the averages are taken in a grand ensemble in one case 
and in a petit in the other, the analogues of entropy, H 
and 77 , are quite different in definition and value. We shall 
return to this point after we have determined the order 
of magnitude of the usual anomalies of ... 

If we differentiate equation (518) with respect to /ii, and 
multiply by @, we get 

whence = — iq, which agrees with (527). Differen¬ 

tiating again with respect to and to /u^, and setting 


we get 


%vx 


dil - dfO 

— = — Vi, — = — J'j, 

a/ll a/i2 


2 , + 




In . . .[v» 


= 0, (637) 


n+A‘i»'i • • • 


2 n( 


dfjLi d/it 


(n —n)(«v^s)\e 


© 


y 


in • • • ll* 


:0. (538) 


The first members of these equations represent the average 
values of the quantities in the principal parentheses. We 
have therefore 


(n — vi)“ = — vi = — 0 


dHl 




(639) 


(vx— n) (v* —r*) = n F, — n vi = — I 


= (540) 

a/ll 
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From equation (589) we may get an idea of the order of 
magnitude of the divergences of from its average value 
in the ensemble, when that average value is great. The 
equation may be written 


(vi — vi)® ® dvi 

Vi^ Vx d^x 


(541) 


The second member of this equation will in general be small 
when is great. Large values are not necessarily excluded, 
but they must be confined within very small limits with re¬ 
spect to /i. For if 


(vi -- VxY 
- 2 



(542) 


for all values of between the limits /xj' and /ij", we shall 
have between the same limits 


and therefore 


JTT <*Vi > dflx > 
Fit 



(543) 

(544) 


The difference is therefore numerically a very small 

quantity. To form an idea of the importance of such a 
difference, we should observe that in formula (498) fix is 
multiplied by and the product subtracted from the energy. 
A very small difference in the value of may therefore be im¬ 
portant. But since is always less than the kinetic energy 
of the system, our formula shows that — /aj', even when 
multiplied by or may still be regarded as an insensible 
quantity. 

We can now perceive the leading characteristics with re¬ 
spect to properties sensible to human faculties of such an en¬ 
semble as we are considering (a grand ensemble canonically 
distributed), when the average numbers of particles of the vari¬ 
ous kinds are of the same order of magnitude as the number 
of molecules in the bodies which are the subject of physical 
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experiment. Although the ensemble contains systems having 
the widest possible variations in respect to the numbers of 
the particles which they contain, these variations are practi¬ 
cally contained within such narrow limits as to be insensible, 
except for particular values of the constants of the ensemble. 
This exception corresponds precisely to the case of nature, 
when certain thermodynamic quantities corresponding to 0, 
yui, ftj, etc., which in general determine the separate densities 
of various components of a body, have certain values which 
make these densities indeterminate, in other w^ords, when the 
conditions are such as determine coexistent phases of matter. 
Except in the case of these particular values, the grand en¬ 
semble would not differ to human faculties of perception from 
a petit ensemble, viz., any one of the petit ensembles which it 
contains in which i/i, i/j, etc., do not sensibly differ from their 
avenige values. 

Let us now compare the quantities H and the average 
values of which (in a grand and a petit ensemble respectively) 
we have seen to correspond to entropy. Since 


H 


1/1 . . . -f V* — e 

; - j 

0 


and 




H~77 = 


© 

Q -f Ml * * » + ^ ^ . 

0 


(545) 


A part of this difference is due to the fact that H relates to 
generic phases and rf to specific. If we write ??gen for the 
index of probability for generic phases in a petit ensemble, 
we have 

•>71611 == log [vi... [vj, (546) 

H — >7 = H — T/gen + log [i^ .. . (547) 

H — 77gen =-^-log [i^... (548) 


This is the logarithm of the probability of the petit en¬ 
semble (j'l . . . If we set 

• See formula (517). 
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which corresponds to the equation 

if/ — € _ 






we have 


and 


H >7^11 — 


^gen = ® log 

O -f ^iVi ... 4- iJn^h —• 


@ 


This will have a maximum when * 

^«Agen _ 




etc. 


(549) 

(550) 

(551) 

(552) 


_ 

Distinguishing values corresponding to this maximum by 
accents, we have approximately, when are of the 

same order of magnitude as the numbers of molecules in ordi¬ 
nary bodies, 


H — Vu.n ‘ 


n -h fiivi ... 4- fihVh — 

0 

_ O 4- fllW « • ■ 4- — <Agen^ 

0 


_ (AviY / AviAvg ^Vgen Y 

\ ^ ® / 0 \ dv^^^ ) 2 © ’ 

(553) 


H—ijgea , 

O - 

/d*^gen\ ' (V Av, Ai'g 


V'(Ava)3 

= e^e ^ \dvidvg) 0 

Kdvk^^ 

' 20 




(654) 

where 

^ _ n +• jMi vi' ... + 

f 

(555) 

0 

■j 

and 

Avi = vi — vi', Ava = ^2 — 

etc. 

(556) 

This is 

the probability of the system (vi • • • 


The prob- 


abilty that the values of , . • . lie within given limits is 
given by the multiple integral 


* Strictly speaking, is not determined as function of kj, ... except 
for integral Talues of these variables. Yet we may suppose it to be deter¬ 
mined aa a continuous function by any suitable process of interpolation. 
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/ ^gcD _ / fiVgep Y< 

\ / 2 0 \dvidv^) 0 * ’ \ / 


/ n ^^^gcn VAviAyg _ / Y[A^ 

...ie^e ^ 0 *" \dvH^ ) 20 clvi,..dvf,. 

(557) 

This shows that the distribution of the grand ensemble with 
respect to the values of . . . VJ^ follows the “ law of errors ” 
when I//, • . • Vf! are very great. The value of this integral 
for the limits ± qo should be unity. This gives 


where -D = 


C = ilogi)- 2 log( 2 ,r®), 

/^Y . 

\ J \dvidvi J \dvidpi,J 

/_^Y /^.y.f^Y 

_ \dvidpi J \ dvt^ ) \di’idv^J 


If ( ^v. ..Y. (^\ 

1 \ <^>''1 / \dv.dv^J \ dv^ J 

/^Y /^Y./^Y 

/ \dvi) . \dvi) 

/djH\ fdj^'sf fdjH\ 

that is, V .\ 


f^^Y . 

\ dv^J . \dvh) I 

Now, by (553), we have for the first approximation 

H - = C = i log i> - ^ log ( 2 ,r©), ( 662 ) 

and if we divide by the constant K,* to reduce these quanti¬ 
ties to the usual unit of entropy, 

H - Vp.. _ log ^ log ( 2 ^^®) /kcon 

" 2K 


* See page 184-186. 
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This is evidently a negligible quantify, since K is of the same 
order of magnitude as the number of molecules in ordinary 
bodies. It is to be observed that ^gen is here the average in 
the grand ensemble, whereas the quantity which we wish to 
compare with H is the average in a petit ensemble. But as we 
have seen that in the case considered the grand ensemble would 
appear to human observation as a petit ensemble, this dis¬ 
tinction may be neglected. 

The differences therefore, in the case considered, between the 
quantities which may be represented by the notations * 

are not sensible to human faculties. The difference 

and is therefore constant, so long as the numbers , . , . 
are constant. For constant values of these numbers, therefore, 
it is immaterial whether we use the average of or of rj for 
entropy, since this only affects the arbitrary constant of in¬ 
tegration which is added to entropy. But when the numbers 
Vj,. . , Vf^ are varied, it is no longer possible to use the index 
for specific phases. For the principle that the entropy of any 
body has an arbitrary additive constant is subject to limi¬ 
tation, when different quantities of the same substance are 
concerned. In this case, the constant being determined for 
one quantity of a substance, is thereby determined for all 
quantities of the same substance. 

To fix our ideas, let us suppose that we have two identical 
fluid masses in contiguous chambers. The entropy of the 
whole is equal to the sum of the entropies of the parts, and 
double that of one part. Suppose a valve is now opened, 
making a communication between the chambers. We do not 
regard this as making any change in the entropy, although 
the masses of gas or liquid diffuse into one another, and al¬ 
though the same process of diffusion would increase the 

* In this paragraph, for greater distinctness, and have 

been written for the quantities which elsewhere are denoted by H and 
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entropy, if the masses of fluid were different. It is evident, 
therefore, that it is equilibrium with respect to generic phases, 
and not with respect to speciflc, with which we have to do in 
tlie evaluation of entropy, and therefore, that we must use 
the average of H or of and not that of as the equiva¬ 
lent of entro})}', except in the tliermodynamics of bodies in 
which the numl)er of molecules of the various kinds is 
constant 
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PREFATORY NOTE TO PART TWO 


in a few cases sligfit corrections had been made by the author 
in his own copies of the papers. These changes, together with the 
correction of obvious misprints in the originals, have been incor¬ 
porated in the present edition without comment. Where for the 
sake of clearness it lias seemed desirable to insert a word or two 
in a footnote or in the text itself, the addition has been indicated 
by enclosing it within square brackets [ ], a sign which is otherwise 
used only in the formulae. 
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I. 


ON THE FUNDAMENTAL FORMULAE OF DYNAMICS. 


[American Journal of Mathematics^ vol. ii. pp, 49-64, 1879.] 


Fo'i'mation of a new Indeterminate Formula of Motion hy the Sub¬ 
stitution of the Variations of the Components of Acceleration for 
the Variations of the Coordinates in the usual Formula. 


The Jaws of motion are frequently expressed by an equation of the 

form I,[{X-mx)Sx+(Y-7ny)Sy+(Z-mz)Sz] = 0, (1) 

in which 

m denotes the mass of a particle of the system considered, 

X, y, z its rectangular coordinates, 

X, y, z the second differential coefficients of the coordinates with 
respect to the time, 

X, Yy Z the components of the forces acting on the particle, 

Sx, Sy, Sz any arbitrary variations of the coordinates which are 
simultaneously possible, and 

2 a summation with respect to all the particles of the system. 
It is evident that we may substitute for Sx, 6y, Sz any other 
expressions which are capable of the same and only of the same sets 
of simultaneous values. 

Now if the nat\ire of the system is such that certain functions 
A, By etc. of the coordinates must be constant, or given functions of 
the time, we have 



These are the equations of condition, to which the variations in 
the general equation of motion (1) are subject. But if A is constant 
or a determined function of the time, the same must be true of 
A and A. Now 


.• ^fdA . .dA . .dA A 
V ^/dA .. dA .. dA , jr 
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where H representvS terms containing only the second differential 
coefficients of A with respect to the coordinates, and the first differ¬ 
ential coefficients of the coordinates with respect to the time. There¬ 
fore, f we conceive of a variation affecting the accelerations of the 
particles at the time considered, but not their positions or velocities, 
we have 


6A 



and, in like manner, 


dB ^ n 


Comparing these equations with (2), we see that when the Ojccelera- 
turns of the particles are regarded as subject to the variation denoted 
by (5, but not their positions or velocities, the possible values of Sx, Sy^ 
Sz are subject to precisely the same restrictions as the values of Sx, 
Sy, Sz, when the positions of the particles are regarded as variable. 
We may, therefore, write for the general equation of motion 


2 [{X --mx)Sx-{-{Y--'iiny)Sy A'{Z—niz)Sz]=^0, (4) 


regarding the positions and velocities of the particles as unaffected by 
the variation denoted by S ,—a condition which may be expressed by 


the equations 52 = 0,i 

— (5y = (), = . 


(5) 


We have so far supposed that the conditions which restrict the 
possible motions of the systems may be expressed by equations 
between the coordinates alone or the coordinates and the time. To 
extend the formula of motion to cases in which the conditions are 
expressed by the characters ^ or we may wnte 

2 [(Y — Y'-my)Sy ■\‘{Z‘-m'z)6z] ^ 0. (6) 

The conditions which determine the possible values of Sx, Sy, Sz 
will not, in such cases, be entirely similar to those which determine 
the possible values of Sx, Sy, Sz, when the coordinates are regarded as 
variable. Nevertheless, the laws of motion are correctly expressed 
by the formula (6), while the formula 

2 [{X — 7nx)Sx -f ( F— rny)Sy -f- (Z— mz) Sz] ^0, (7) 

does not, as naturally interpreted, give so complete and accurate an 
expression of the laws of motion. 

This may be illustrated by a simple example. 

Let it be required to find the acceleration of a material point, 
which, at a given instant, is moving with given velocity on the 
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frictionlees surface of a body (which it cannot penetrate, but which 
it may leave), and is acted on by given forces. For simplicity, we 
may suppose that the normal to the surface, drawn outward from the 
moving point at the moment considered, is parallel to the axis of X 
and in the positive direction. The only restriction on the values of 
Sx, Sy, Sz is that n 


Formula (7) will therefore give 



2 = 


z 

m’ 


The condition that the point shall not penetrate the body gives 
another condition for the value of £c. If the point remains upon the 
surface, x must have a certain value N, determined by the form of 
the surface and the velocity of the point. If the value of x is less 
than this, the point must penetrate the body. Therefore, 

x^N. 

But this does not suffice to determine the acceleration of the point. 

Let us now apply formula (6) to the same problem. Since x cannot 

be less than .. .. 

It x = Xy 6x^0. 

This is the only restriction on the value of Sx, for if x > N, the 
value of Sx is entirely arbitrary. Formula (6), therefore, requires 
that Y 

U x = N’, x^ — ; 

but if 03 > A", x = ~ : 

m 


—that is (since x cannot be less than iV), that x shall be equal to the 
greater of the quantities N and ”» both, if they are equal,— 


and that 


F 


The values of x, y, z are therefore entirely determined by this 
formula in connection with the conditions afforded by the constraints 
of the system.* 

The following considerations will show that what is true in this 
case is also true in general, when the conditions to which the system 


* The failure of the formula (7) in this case is rather apparent than real; for, 
although the formula apparently allows to x, at the instant considered, a value 
y 

exceeding both N and —, it does not allow this for any interval, however short. For 
m 

if x<N, the point will immediately leave the surface, and then the formula requires 
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is subject are such that certain functions of the coordinates cannot 
exceed certain limits, either constant or variable with the time. If 
certain values of (5a5, Si), Sz (with unvaried values of x, y, z, and x, y, z) 
are simultaneously possible at a given instant, equal or proportional 
values with the same signs must be possible for Sx, Sy, Sz immediately 
after the instant considered, and must satisfy formula (1), and there¬ 
fore (6), in connection with the values of x, y, z, A^ V, Z immediately 
after that instant. The values of .c, ij, z, thus determined, are of 
course the very quantities which we wish to obtain, since the accelera¬ 
tion of a point at a given instant does not denote anything different 
from its acceleration immediately after that instant. 

For an example of a somewhat different class of cases, we may 
suppose that in a system, otherwise free, x cannot have a negative 
value. Such a condition does not seem to affect the possible values 
of SXy as naturally interpreted in a dynamical problem. Yet, if we 
should regard the value of Sx in (7) as arbitrary, we should obtain 

.. X 

—, 

'ni 

which might be erroneous. But if we regard Sx as expressing a 
velocity of which the system, if at rest, would be capable (which is 
not a natural signification of the expression), we should have Sx^O, 
which, with (7), gives ^ 

~~ m 

This is not incorrect, but it leaves the acceleration undetermined. 
If we should regard Sx as denoting such a variation of the velocity 
as is possible for the system when it has its given velocity (this also 
is not a natural signification of the expression), formula (7) would 
give the correct value of x except when x — 0. In this case (which 
cannot be regarded as exceptional in a problem of this kind), we 
should have Sx ^ 0, which will leave x undetermined, as before. 

The application of formula (6), in problems of this kind, presents 
no difficulty. From the condition 

we obtain, first, if x = 0, 0, 

then, ifir = 0 and x — O, Sx^O, 

which is the only limitation on the value of Sx, With this condition, 
we deduce from (6) that either 

V 

flc = 0, .T =s0 , and x'^—\ 

~~ m 



or 





ON THE FUNDAMENTAL FORMULAE OF DYNAMICS. 


5 


That is, if x — 0,x has the greater of the values — and 0; otherwise, 

In cases of this kind also, in which the function which cannot 
exceed a certain value involves the velocities (with or without the 
coordinates), one may easily convince himself that formula (6) is 
always valid, and always sufficient to determine the accelerations 
with the aid of the conditions afforded by the constraints of the 
system. 

But instead of examining such cavses in detail, we shall proceed to 
consider the subject from a more general point of view. 


Comparison of the New Ftynnula with the Statical Princijyle of 

Virtwal Velocities.—Case of Discontinuous Changes of Velocity. 

Formula (1) has so far served as a point of departure. The general 
validity of this, the received form of the indeterminate equation of 
motion, being assumed, it has been shown that formula (6) will be 
valid and sufficient, even in cases in which both (1) and (7) fail. We 
now proceed to show that the statical principle of virtual velocities, 
when its real signification is carefully considered, leads directly to 
formula (6), or to an analogous formula for the determination of the 
discontinuous changes of velocity, when such occur. This will be the 
case even if we start with the usual analytical expression of the 
principle •2(XSx+YSy+ZSz)S.O, (8) 

to which, at first sight, formula (6) appears less closely related than 
(7). For the variations of the coordinates in this formula must be 
regarded as relating to differences between the configuration which 
the system has at a certain time, and which it will continue to have 
in case of equilibrium, and some other configuration which the system 
might be supposed to have at some subsequent time. These temporal 
relations are not indicated explicitly in the notation, and should not 
be, since the statical problem does not involve the time in any 
quantitative manner. But in a dynamical problem, in which we take 
account of the time, it is hardly natural to use Sx, Sy, 8z in the same 
sense. In any problem in which x, y, z are regarded as functions 
of the time, 8x, Sy, Sz are naturally understood to relate to differences 
between the configuration which the system has at a certain time, 
and some other configuration which it might (conceivably) have had 
at that time instead of that which it actually had. 

Now when we suppose a point to have a certain position, specified 
by X, y, z, at a certain time, its position at that timef’*!^ no longer a 
subject of hypothesis or of question. It is its future positions which 
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form the subject of inquiry. Its position in the immediate future 
is naturally specified by 

y-hydt-^iydt^-^etc., 

and we may regard the variations of these expressions as corre¬ 
sponding to the Sxy Syy Sz of the statical problem. It is evidently 
sufficient to take account of the first term of these expressions of 
which the variation is not zero. Now, x, y, z, as has already been 
said, are to be regarded as constant. With respect to the terms 
containing Xy y, two cases are to be distinguished, according as 
there is, or is not, a finite change of velocity at the instant considered. 

Let us first consider the most important case, in which there is no 
discontinuous change of velocity. In this case, Xy y, z are not to be 
regarded as variable (by S), and the variations of the above ex¬ 
pressions are represented by 

h ^x dt^y ^Sy dt^y i oz dt'^y 

which are, therefore, to be substituted for Sx, (5y, Sz in the general 
formula of equilibrium (8) to adapt it to the conditions of a dynamical 
problem. By this substitution (in which the common factor 
may of course be omitted), and the addition of the terms expressing 
the reaction against acceleration, we obtain formula (6). 

But if the circumstances are such that there is (or may be) a 
discontinuity in the values of x, y, z at the instant considered, it is 
necessary to distinguish the values of these expressions before and 
after the abrupt change. For this purpose, we may apply flc, y, z 
to the original values, and denote the changed values by x^AXy 
y-fAy, ij-hAi. The value of a; at a time very shortly subsequent 
to the instant considered, will be expressed by £C-|-(£6-f Aac)di + etc., in 
which we may regard Ax as subject to the variation denoted by 8. 
The variation of the expression is therefore 8 Ax dt. Instead of — Tnai, 
which expresses the reaction against acceleration, we need in the 
present case —mAx to express the reaction against the abrupt change 
of velocity. A reaction against such a change of velocity is, of 
course, to be regarded as infinite in intensity in comparison with 
reactions due to acceleration, and ordinary forces (such as cause 
acceleration) may be neglected in comparison. If, however, we 
conceive of the system as acted on by impulsive forces (i.e., such 
as have no finite duration, but are capable of producing finite changes 
of velocity, and are measured numerically by the discontinuities of 
velocity which they produce in the unit of mass), these forces should 
be combined with the reactions due to the discontinuities of velocity 
in the general formula which determines these discontinuities. If 
the impulsive forces are specified by X, Y, Z, the formula will be 

[(X — mAcc) 5Aa;-|-(Y—mAy) ^Ay-I-(Z—mAi;) dAi)] ^0. (9) 
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The reader will remark the strict analogy between this formula 
and (6), which would perhaps be more clearly exhibited if we should 

write ^ y> ^ formula. 

But these formulae may be established in a much more direct 
manner. For the formula (8), although for many purposes the most 
convenient expression of the principle of virtual velocities, is by no 
means the most convenient for our present purpose. As the usual 
name of the principle implies, it holds true of velocities as well 
as of displacements, and is perhaps more simple and more evident 
when thus applied.* 

If we wish to apply the principle, thus understood, to a moving 
system so as to determine whether certain changes of velocity 
specified by Ax, Ay, As are those which the system will really receive 
at a given instant, the velocities to be multiplied into the forces and 
reactions in the most simple application of the principle are manifestly 
such as may be imagined to be compounded with the assumed 
velocities, and are therefore properly specified by (5Ax, 5Ay, <5As. 
The formula (9) may therefore be regarded as the most direct appli¬ 
cation of the principle of virtual velocities to discontinuous changes 
of velocity in a moving system. 

In the case of a system in which there are no discontinuous changes 
of velocity, but which is subject to forces tending to produce accelera¬ 
tions, when we wish to determine whether certain accelerations, 
specified by x, y, s, are such as the system will really receive, it is 
evidently necessary to consider whether any possible variation of 
these accelerations is favored more than it is opposed by the forces 


* Even in Statics, the principle of virtual velocities^ as distinguished from that of 
virtual displacemeivts^ has a certain advanttige in respect of its evidence. The demon¬ 
stration of the principle in the first section of the M6canique Avxdytique^ if velocities 
had been considered instead of displacements, would not have been exposed to an 
objection, which has been expressed by M. Bertrand in the following words: “ On a 
objects, aveo raison, 4 cette assertion de Lagrange I’example d’un point pesant en 
^uilibre au sommet le plus 41evd d’une courbe; il eat Evident qu’un ddplacement 
infiniment petit le ferait descendre, et, pourtont, ce d^placement ne se produit pas.” 
{M^caihique Analytiquey trois^me ^ition, tome 1, page 22, note de M. Bertrand.) The 
value of z (the height of the point above a horizontal plane) can certainly be diminished 
by a displacement of the point, but the value of z is not affected by any velocity given 
to the point. 

The real difficulty in the consideration of displacements is that they are only possible 
at a time subsequent to that in which the system has the configuration to which the 
question of equilibrium relates. We may make the interval of time infinitely short, 
but it will alwaj’^s be difficult, in the establishing of fundamental principles, to treat a 
conception of this kind (relating to what is possible after an infinitesimal interval of 
time) with the same rigor as the idea of velocities or accelerations, which, in the cases 
to which (9) and (6) respectively relate, we may regard as communicated immediately to 
the system. 
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and reactions of the system. The formula (6) expresses a criterion 
of this kind in the most simple and direct manner. If we regard 
a force as a tendency to increase a quantity expressed by .f, the 
product of the force by Sx is the natural measure of the extent to 
which this tendency is satisfied by an arbitrary variation of the 
accelerations. The principle expressed b}^ the formula may not be 
very accurately designated by the words virtual velocities, but it 
certainly does not differ from the principle of virtual velocities (in 
the stricter sense of the term), more than this differs from that of 
virtual displacements,—a difference so slight that the distinction of 
the names is rarely insisted upon, and that it is often very difficult 
to tell which form of the principle is especially intended, even when 
the principle is enunciated or discussed somewhat at length. 

But, although the formuhe (6) and (9) differ so little from the 
ordinary formuhe, they not only have a marked advantage in respect 
of precision and accuracy, but also may be more satisfactory to the 
mind, in that the changes considered (to which S relates), are not so 
violently opposed to all the possibilities of the case as are those which 
are represented by the variations of the coordinates.* Moreover, as 
we shall see, they naturally lead to various important laws of motion. 

Trausfor'tnation of the New Forruula. 

Let us now consider some of the transformations of which our 
general formula (6) is capable. If we separate the terms containing 

* It may have seemed to some readers of the M4canique AnaJytique —a work of which 
the unity of method is one of the most striking characteristics, and that to which its 
universally recognized artistic merit is in great measure due—that the treatment of 
dynamical problems in that work is not entirely analogous to the treatment of statical 
problems. The statical question, whether a system will remain in equilibrium in a 
given configuration, is determined by Lagrange by considering all possible motions of 
the system and inquiring whether there is any reason why the system should take any 
one of them. A similar method in dynamics would be based upon a comparison of a 
proposed motion with all other motions of which the system is capable without violating 
its kinematical conditions. Instead of this, Lagrange virtually reduces the dynamical 
problem to a statical one, and considers, not the possible variations of the proposed 
motion, but the motions which would l)e possible if the system were at rest. This 
redviction of a given problem to a simpler one, which has already been solved, is a 
method which has its advantages, but it is not the characteristic method of the 
Micanique AnaJytique. That which most distinguishes the plan of this treatise from 
the usiial type is the direct application of the general principle to each particular case. 

The point is perhaps of small moment, and may be differently regarded by others, 
but it is mentioned here because it wa.s a feeling of this kind (whether justified or not) 
and the desire to express the formula of motion by means of a maximum or minimum 
condition, in which the conditions under which the maximum or minimum subsists 
should l)e such as the problem naturally affords (Gauss’s principle of least constraint 
being at the time unknown to the present writer, and the conditions under which the 
minimum subsists in the principle of least action being such that that is hardly satis¬ 
factory as a fundamental principle), which led to the formulee proposed in this paper. 
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the masses of the particles from those which contain the forces, we 
have •2,{X&x+Y&y + Z&z)-'L\_\mS(x'^-tr-jp+ii^)]^0, (10) 

or, if we write u for the acceleration of a particle, 

'Z{X6x^ ( 11 ) 

If, instead of terms of the form XSx, or in addition to such terms, 
equation (1) had contained terras of the form PSp, in which p denotes 
any quantity determined by the coniiguration of the system, it is 
evident that these would give terms of the form PSp in (6), (10) and 
(11). For the considerations which justified the substitution of 
Sy, Sz for Sx, Sy, Sz in the usual formula were in no respect dependent 
upon the fact that x, y, z denote rectangular coordinates, but would 
apply equally to any other quantities which are determined by the 
configuration of the system. 

Hence, if the moments of all the forces of the system are represented 
by the sum ^{Pdp), 

the general formula of nmtion may be written 

jii(Pop)~(5Z(Jmu*)^0 (12) 

If the forces admit of a force-function F, we have 
(SF-^2(Jmu2)^0, 

or <5[F—2(im'?P)]^0. (13) 

But if the forces are determined in any way whatever by the 
configuration and velocities of the .system, with or without the time, 
X, F, Z and P will be unaffected by the variation denoted by 8, and 
we may write the formula of motion in the form 

8'I.{Xx^Yy-\-Z'z-\mu^)^0, (14) 

or 8[»(Pp)-I.(imu^y]^0. (15) 

If the forces are determined by the configuration alone, or the 
configuration and the time, (5X = 0, ^F=0, 6Z=0, oP = 0, and the 
general formula may be written 

^ ^ S(Xx + Yy + Zz) - Kimw*)] S 0, (16) 

or ^[J^JKPp)-S(imu“)]sO. (17) 

The quantity affected by <5 in any one of the last five formulae 
has not only a maximum value, but absolutely the greatest value 
consistent with the constraints of the system. This may be shown 
in reference to (15) by giving to p, .r, y, z, contained explicitly 

or implicitly in the expression affected by 8, any possible finite 




10 


ON THE FUNDAMENTAL FORMULAE OF DYNAMICS. 


increments x\ y. z\ and subtracting the original value of the 
expression from the value thus modified. Now, 

= (Pp') - 2 [^{xx' 4* yy' 4- zz')] ~ 2 [ im (aj '2 4 . y ^ 4 . ^' 2 )]. 

But since p, x', y', z are proportional to and of the same sign with 
possible values of (5p, <5x, ^y, 6z, we have, by the general formula of 
motion, JI(Pp') - 2 + ijy' + iz')] g 0 . 

The second member of the preceding equation is therefore negative. 
The first member is therefore negative, which proves the proposition 
with respect to (15). The demonstration is precisely the same with 
respect to (13) and (14), which may be regarded as particular 
cases of (15). 

To show the same with regard to (16) and (17), we have only to 
observe that the quantities affected by S in these formulae differ from 
those affected by the same syrnlx)! in (14) and (15) only by the terms 

^{Xx+Yy^Zz) and j|(Pp), 

which will not be affected by any change in the accelerations of 
the system. 

When the forces are determined by the configuration (with or 
without the time), the principle may be enunciated as follows: The 
accelerations in the system are always such that the acceleration of 
the rate of work done by the forces diminished by one-half the sum 
of the products of the masses of the particles by the squares of their 
accelerations has the greatest possible value. 

The formula (17), although in appearance less simple than (15), not 
only is more easily enunciated in words, but has the advantage that 

the quantity entirely determined by the system with its 

forces and motions, which is not the case with J^(Pp). The value of 
the latter expression depends upon the manner in which we choose to 
represent the forces. For example, if a material jx)int is revolving in 
a circle under the influence of a central force, we may write either 
Xx-\-Yy + Zz or Rr for Pp, R and r denoting respectively the force 
and radius vector. Now Xx-^Yy-\-Zz is manifestly unequal to Rf. 

But Xx+Yy-^Zz is equal to Rr, and i{Xx-^Yy-YZz) is equal to 
j dt 

It may not be without interest to see what shape our general 
formulae will take in one of the most important cases of forces 
dependent upon the velocities. If a body which can be treated as a 
point is moving in a medium which presents a resistance expressed by 
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any function of the velocity, the terms due to that resistance in the 
general formula of motion may be expressed in the form 

^ [0(V)^ ® f y 1 2 ] - 

where v denotes the velocity and the resistance. But 

I ” 1 ^ 7 , TJ • 

V V V at 

The terms due to the resistance reduce, therefore, to 

S[(p(v)v], 

.d 


or, 




where / denotes the primitive of the function denoted by <p. 

Dificontinuo'iis Changes of Velocity .—Formula (9), which relates to 
discontinuous changes of velocity, is capable of similar transformations. 
If we set Ay2-j-Ai2, 

the formula reduces to 


^ Z (X A^p+Y A3/ + Z A0 - i mw^) ^0, ( 18 ) 

where X, Y, Z are to be regarded as constant. If represents 

the sum of the moments of the impulsive forces, and we regard P as 
constant, we have 

0 (PAp) - 2( <0. (19) 

The expressions affected by o in these formulae hav^e a greater value 
than they would receive from any other changes of velocity consistent 
with the constraints of the system. 


Deduction, of other Properties of Motion. 

The principles which have been established furnish a convenient 
point of departure for the demonstration of various properties of 
motion relating to maxima and minima. We may obtain several 
such properties by considering how the accelerations of a system, at a 
given instant, will be modified by changes of the forces or of the 
constraints to which the system is subject. Let us suppose that the 
forces X, F, Z of a system receive the increments X\ Y', Z\ in con¬ 
sequence of which, and of certain additional constraints, which do not 
produce any discontinuity in the velocities, the components of accelera¬ 
tion X, y, z receive the increments x\ if\ z\ The expression 

Z[(X+ X'){^x + ic') -f ( F-f Y'){i) + y) +(^-f X'){z 4- z) 

- Jto {(x+ x'f+{y + yf +(s+ 2 ')®} ] (20) 
will be the greatest possible for any values of x, y', s' consistent with 
the constraints. But this expression may be divided into three parts, 
2[(Z+X),x+(F+ Y')ii+{Z-\-Z')z-^m{:3?+y^+&)l (21) 

2[Xx'+ Yy'+Zz'—m{xx' + yy' (22) 
2[Z'x'+ F/+Z's'-Jm(x3+y'2+2'*)]. (23) 


and 
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The first part is evidently constant with reference to variations of 
x\ y\ and may, therefore, be neglected. With respect to the second 
part we observe that by the general formula of the motion we have 
'L[XSx+YSy-\' ZSz — m(itSx -h ijSy -f zSz)] = 0 
for all values of Sx, Sy, Sz which are possible and reversible before the 
addition of the new constraints. But values proportional to x, y\ z\ 
and of the same sign, are evidently consistent with the original con¬ 
straints, and when the components of acceleration are altered to 
x-\-x\ y-{-y\ variations of these quantities proportional to and 

of the same sign as —x\ —y\ —z' are evidently consistent with the 
original constraints. Now if these latter variations were not possible 
before the accelerations were modified by the addition of the new forces 
and constraints, it must be that some constraint was then operative 
which afterwards ceased to be so. The expression (22) will, therefore, 
be equal to zero, provided only that all the constraints which were 
operative before the addition of the new forces and constraints, remain 
operative afterwards,* With this limitation, therefore, the expression 
(23) must have the greatest value consistent with the constraints. 
This principle may be expressed without reference to lectangular 
coordinates. If we write u for the relative acceleration due to the 
additional forces and constraints, we have 


and expression (23) reduces to 

2(Z'.x'-H F'V/'-f Z's' —Jmu'^). (24) 


If the sum of the moments of the additional forces which are 
considered is represented by S^{Qdq) (the q representing quantities 
determined by the configuration of the system), we have 

l{Tx-^ry-hZ'z) = ^{Qq), 


We may distinguish the values of 


d^q 

W 


immediately before and imme¬ 


diately after the application of the additional forces and constraints 
by the expressions q and With this understanding, we have, 

by difierentiation of the preceding equation. 


2[X F y-\-Z'z X'{x~\-x')'\- Y{y-\ry')’\-Z' 

=^{Qq+Q(.q+q)V, 


* As an illustration of the significance of this limitation, we may consider the con¬ 
dition afforded by the impenetrability of two bodies in contact. Let us suppose that if 
subject only to the original forces and constraints they would continue in contact, but 
that, under the influence of the additional forces and constraints, the contact will cease. 
The impenetrability of the bodies then ceases to be operative as a constraint. Such 
cases form an exception to the principle which is to be established. But there are no 
exceptions when all the original constraints are expvesaed by eqwoUiona. 
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whence it appears that Z'z) differs from only 

by quantities which are independent of the relative acceleration due 
to the additional forces and constraints. It follows that these relative 
accelerations are such as to make 

(25) 

a maximum. 

It will be observed that the condition which determines these 
relative accelerations is of precisely the same form as that which 
determines absolute accelerations. 

An important case is that in which new constraints are added but 
no new forces. The relative accelerations are determined in this case 
by fche condition that S is a minimum. In any case of motion, 

in which finite forces do not act at points, lines or surfaces, we may 
first calculate the accelerations which would be produced if there were 
no constraints, and then determine the relative accelerations due to 
the constraints by the condition that is a minimum. This 

is Gauss’s principle of lecLst vovi^traud* 

Again, in any case of motion, we may suppose u to denote the 
acceleration which would be produced by the constraints alone, and u' 
the relative acceleration produced by the forces; we then have 

2 [m -f PS + ] = 0, 

whence, if we write u' for the resultant or actual acceleration, 

Z (i mv?-) -h Z (— Z ( 

Moreover, difierentiating (25), we obtain 

^ {QSq')-^ [m -f = 0, 

whence, since ex', Sy\ Sz may have values proportional to q\ 

JS(C9) = 22(imV^). 

The.se relations are similar to those which exist with respect to 
vis viva and impulsive forces. 

Particular Equations of Motion. 

From the general formula (12), we may easily obtain particular 
equations which will express the laws of motion in a very general 
form. 

Let cZojj, cUo^, etc. be infinitesimals (not necessarily complete 
differentials) the values of which are independent, and by means 

^ This principle may be derived very directly from the general formula (6), or vice 
ver«a, for 2 may be put in the form 

the variation of which, with the sign changed, is identical with the first member of (6). 
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of which we can perfectly define any infinitesimal change in the 
configuration of the system; and let 


— ^2 


' dt ’ 


dt 


etc., 


where ^ ^ determined by the change in the con¬ 

figuration in the interval of time dt ; and let 

dth^ dwo 




etc. 


dt ' dt 

Also let 2 (i mu^). 

It is evident that U can be expressed in terms of Wg, etc., 
ajj, c(> 2 > > quantities which express the configuration of the 

system, and that (since <5 is used to denote a variation which does 
not affect the configuration or the velocities), 

.jT dU.dU.^. 

oU ~ -j — 00)1 + oo).} -f-etc. 

cLoy^ do)^ 

Moreover, since the quantities p in the general formula are entirely 
determined by the configuration of the system 


dp 


dp 


dp 

dw2 


where denotes the ratio of simultaneous values of dp and du),, 

dp 

when da) 2 , etc. are equal to zero, and etc. are to be interpreted 

on the same principle. Multiplying by P, and taking the sum with 
respect to the several forces, we have 

S^(Pp) = OiWi 4- ^^2 + 


where 


Q 




ETJ’ 




etc. 


If we difierentiate with respect to ty and take the variation denoted 
by S. we obtain ^(P^i)) = Q^5ai+Ojte2+etc. 

The general formula (12) is thus reduced to the form 




dU\ 


(26) 


If the forces have a potential F, we may write 

(dv du\ ^ . rdv du\ 


dV 


where denotes the ratio of dV and dw^ when dliDj, etc. have the 

value zero, and the analogous expressions are to be interpreted on 
the same principle. 
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If the variations Swj, 60)21 etc. are capable both of positive and of 
negative values, we must have 


or, 


da, da,~^^^ 
^^rlV dU_^ 
dwi dw^ dio^ dw^ 


etc., 


etc. 


(28) 

(29) 


To illustrate the use of these equations in a case in which dwi, doo^, 
etc. are not exact differentials, we may apply them to the problem of 
the rotation of a rigid body of which one point is fixed. If da>^, dw 2 > 
do)^ denote infinitesimal rotations about the principal axes which pass 
through the fixed point, Qj, Q 2 > ^3 denote the moments of the 
impressed forces about these axes, and the value of U will be given 
by the formula 


2 =(a -f 6 -f” c)(4“ <*>3)^ — (wi -h 0)2 + (hX){oAjo\ 4“ 4” Ccos) 

4- 2(6 — c) ( 020 ) 3 (^ 14 - 2(c — a)(il)3(bi6^ 4- 2 (a — b)d)i(i)2^ 
4-(64-<?)a>?4-(c4-a)^4“(a4-6)^, 


where a, 6 , and c are constants, a-h 6 , 64 -c, c 4 “<x being the momenta 
of inertia about the three axes. Hence, 


dU 

(lio^ 


= (6 — c ) a ) 2<*>3 + (6 4 “ c ) o >|, 


(W 

dUihi 


= (c — (z)c 03 a>i 4” (c "h U.)d> 2 > 


dU , , 

o)(hi<i> 2 4"“}“ 6 ) W 3 , 


and the equations of motion are 


( C —' 6)ft>2ft>3 + 

-- - -0 + 6- 


Ct>2 — 


(a — c)(h s^i 4 - ^^2 

a^c 


<i)3 


(6— 0 ) 0 ) 10)2 4" 

b+a 



IT. 


ON THE FUNDAMENTAL FORMULA OF STATISTICAL 
MECHANICS, WITH APPLICATIONS TO ASTRONOMY 
AND THERMODYNAMICS. 

[Proceedings of the A'merican Association for the Advancement of Science^ 
vol. xxxiiL pp. 57, 58, 1884.] 

(AB&TRACT.) 

Suppose that we have a great number of systems which consist 
of material points and are identical.in character, but different in 
configuration and velocities, and in which the forces are determined 
by the configuration alone. Let the number of systems in which the 
coordinates and velocities lie severally between the following limits, 
viz., between 

iCj and 

Vi and y^+dy^, 
and Zj+dzj, 

^2 and x^-irdx^, 
etc., 

Xj and Xj-fc?Xj, 

2/1 and y^-\-dy^, 
and z-^-\-dz^y 
Xg and x^’pdx^y 
etc., 

be denoted by 

L dx^ dy^ dz^ dx^ etc. dx^ dy^ dz^ dx^ etc. 

The manner in which the quantity L varies with the time is given 
by the equation 



where t, Xj, y^, Xj, etc., Xj, iCg, etc., are the independent 

variables, and the summation relates to all the coordinates. 

The object of the paper is to establish this proposition (which is not 
claimed as new, but which has hardly received the recognition which 
it deserves) and to show its applications to astronomy and thermo¬ 
dynamics. 
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ELEMENTS OF VECTOR ANALYSIS. 

[Privately printed, New Haven, pp. 17-50, 1881 ; pp. 50-90, 1884.] 

(The fuiidaraentAal principles of the following analysis are such as are familiar under 
a slightly diflPerent form to students of quaternions. The manner in which the subject 
is developed is somewhat different from that followed in treatises on quaternions, since 
the object of the writer does not require any use of the conception of the quaternion, 
being simply to give a suitable notation for those relations between vectors, or between 
vectors and scalars, which seem most important, and which lend themselves most readily 
to analytical transformations, and to explain some of these transformations. Aet-& 
precedent for such a departure from quatemionic usage, Clifford’s Kinematic may be 
cited. In this connection, the name of Grassmann may also be mentioned, to whose 
system the following method attaches itself in some respects more closely than to that 
of Hamilton.) 


CHAPTER T. 

CONCERNING THE ALGEBRA OF VECTORS. 

Fundamental Notions. 

1. Definition .—If anything has magnitude and direction, its mag¬ 
nitude and direction taken together constitute what is called a vector- 
The numerical description of a vector requires three numbers, but 
nothing prevents us from using a single letter for its symbolical 
designation. An algebra or analytical method in which a single letter 
or other expression is used to specify a vector may be called a vector 
algehra or vector analysis. 

Def .—As distinguished from vectors the real (positive or negative) 
quantities of ordinary algebra are called scalars.* 

As it is convenient that the form of the letter should indicate 
whether a vector or a scalar is denoted, we shall use the small Greek 
letters to denote vectors, and the small English letters to denote 
scalars. (The three letters, i, jy ky will make an exception, to be 
mentioned more particularly hereafter. Moreover, ir will be used in 
its usual scalar sense, to denote the ratio of the circumference of 
a circle to its diameter.) 

* The imaginariee of ordinary algebra may be called biecaiarsy and that which oor- 
reaponda to them in the theory of vectors, hivectore. But we shall have no occasion to 
consider either of these. [See, however, footnote on p. 84.] 
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2. Def. —Vectors are said to be equal when they are the same both 
in direction and in magnitude. This equality is denoted by the 
ordinary sign, as a — ^. The reader will observe that this vector 
equation is the equivalent of three scalar equations. 

A vector is said to be equal to zero, when its magnitude is zero. 
Such vectors may be set equal to one another, irrespectively of any 
considerations relating to direction. 

3. Perhaps the most simple example of a vector is afforded by a 
directed straight line, as the line drawn from A to B. We may use 
the notation AB to denote this line as a vector, i.e., to denote its 
length and direction without regard to its position in other respects. 
The points A and B may be distinguished as the origin and the 
terminus of the vector. Since any magnitude may be represented by 
a length, any vector may be represented by a directed line; and it 
will often be convenient to use language relating to vectors, which 
refers to them as thus represented. 

Reversal of Direction, Scalar Multiplication and Division. 

4. The negative sign (—) reverses the direction of a vector. (Some¬ 
times the sign -f may be used to call attention to the fact that the 
vector has not the negative sign.) 

D(f. —A vector is said to be multiplied or divided by a scalar 
when its magnitude is multiplied or divided by the numerical value 
of the scalar and its direction is either unchanged or reversed 
according as the scalar is positive or negative. These operations 
are represented by the same methods as multiplication and division 
in algebra, and are to be regarded as substantially identical with 
them. The terms scalar multiplication and scalar division are used 
to denote multiplication and division by scalars, whether the quantity 
multiplied or divided is a scalar or a vector. 

5. Def —A unit vector is a vector of which the magnitude is unity. 

Any vector may be regarded as the product of a positive scalar 

(the magnitude of the vector) and a unit vector. 

The notation may be used to denote the magnitude of the 
vector a. 

Addition and Subtraction of Vectors. 

6. Def. —The sum of the vectors «, etc. (written a-fyS + etc.) is 
the vector found by the following process. Assuming any point A, 
we determine successively the points B, C, etc., so that AB = a, 
BC = /3, etc. The vector drawn from A to the last point thus deter¬ 
mined is the sum required. This is sometimes called the geometrical 
sum, to distinguish it from an algebraic sum or an arithmetical sum. 
It is also called the resultant, and a, P, etc. are called the components. 



VECTOR ANALYSIS. 


19 


When the vectors to be added are all parallel to the same straight line, 
geometrical addition reduces to algebraic; when they have all the same 
direction, geometrical addition like algebraic reduces to arithmetical. 

It may easily be shown that the value of a sum is not affected 
by changing the order of two consecutive terms, and therefore that 
it is not affected by any change in the order of the terms. Again, it 
is evident from the definition that the value of a sum is not altered 
by uniting any of its terms in brackets, as a4-[/5 + y] + etc., which is 
in effect to substitute the sum of the terms enclosed for the terms 
themselves among the vectors to be added. In other words, the 
commutative and associative principles of arithmetical and algebraic 
addition hold true of geometrical addition. 

7. Bef .—A vector is said to be subtracted when it is added after 
reversal of direction. This is indicated by the use of the sign — 
instead of -f-. 

8. It is easily shown that the distributive principle of arithmetical 
and algebraic multiplication applies to the multiplication of sums of 
vectors by scalars or sums of scalars, i.e., 

(m + ^^ + etc.) [a-|-/34-etc.] =ma 4-'>'-''0 + Btc. 

-f-m/3H-ri./3-f*etc. 

+ etc. 

9. Vector Equation -^.—If we have equations between sums and 
differences of vectors, we may transpose terms in them, multiply or 
divide by any scalar, and add or subtract the equations, precisely as 
in the case of the equations of ordinary algebra. Hence, if we have 
several such equations containing known and unknown vectors, the 
processes of elimination and reduction by which the unknown vectors 
may be expressed in terms of the known are precisely the same, and 
subject to the same limitations, as if the letters representing vectors 
represented scalars. This will be evident if we consider that in the 
multiplications incident to elimination in the supposed scalar equations 
the multipliers are the coefficients of the unknown quantities, or 
functions of these coefficients, and that such multiplications may be 
applied to the vector equations, since the coefficients are scalars. 

10. Linear relation of four vectorH, Coordinates .—If a, and y 
are any given vectors not parallel to the same plane, any other vector 
p may be expressed in the form 

p = aa-f 6^ + Cy. 

If a, /3, and y are unit vectors, a, 6, and c are the ordinary scalar 
components of p parallel to a, and y. If p = OP, (a, /S, y being 
unit vectors), a, fe, and c are the cartesian coordinates of the point P 
referred to axes through O parallel to a, and y. When the values 
of these scalars are given, p is said to be given in terms of a, /3, and y. 
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It is generally in this way that the value of a vector is specified, viz., 
in terms of three known vectors. For such purposes of reference, a 
system of three mutually perpendicular vectors has certain evident 
advantages. 

11. Normal systems of unit vectors. —The letters k are appro¬ 
priated to the designation of a normal system of unit vectors, i.e., 
three unit vectors, each of which is at right angles to the other two 
and determined in direction by them in a perfectly definite manner. 
We shall always suppose that k is on the side of the i-j plane on 
which a rotation from i to j (through one right angle) appears 
counter-clockwise. In other words, the directions of i, f and k are 
to be so determined that if they be turned (remaining rigidly con¬ 
nected with each other) so that i points to the east, and j to the north, 
k will point upward. When rectangular axes of X, Y, and Z are 
employed, their directions will be conformed to a similar condition, 
and i,f k (when the contrary is not stated) will be supposed parallel 
to these axes respectively. We may have occasion to use more than 
one such system of unit vectors, just as we may use more than one 
system of coordinate axes. In such cases, the different systems may 
be distinguished by accents or otherwise. 

12. Numerical computation of a geometrical sum. —If 

p — aa -|- 6/? -j- Gy, 

(T = a! o. -|- 6 4“ c y, 

etc., 

then 

p "f O' -f etc. = (a 4-4- etc.) a -f (6 4- 6" -f etc.) /3 + (c 4- c! 4- etc.) y, 

i.e., the coefficients by which a geometrical sum is expressed in terms 
of three vectors are the sums of the coefficients by which the separate 
terms of the geometrical sum are expressed in berms of the same 
three vectors. 


Direct and Skew Products of Vectors. 

13. Def. —The direct prodtict of a and ^ (written a./S) is the scalar 
quantity obtained by multiplying the product of their magnitudes 
by the cosine of the angle made by their directions. 

14. Def —The skew product of a and /9 (written ax/3) is a vector 
function of a and Its magnitude is obtained by multiplying the 
product of the magnitudes of a and by the sine of the angle made 
by their directions. Its direction is at right angles to a and and 
on that side of the plane containing a and /3 (supposed drawn from 
a common origin) on which a rotation from a to yS through an arc 
of less than 180” appears counter-clockwise. 

The direction of ax/3 may also be defined as that in which an 
ordinary screw advances as it turns so as to carry a toward y8. 
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Again, if a be directed toward the east, and /S lie in the same 
horizontal plane and on the north side of a, axjS will be directed 
upward. 

15. It is evident from the preceding definitions that 

a./8==/3.a, and aX/3=—^Xa. 

16. Moreover, lna\./3~a.[n^] = n[a./3]y 

and ['>^a]x^=ax[?^/3] = r^[aX/5]. 

The brackets may therefore be omitted in such expressions. 

17. From the definitions of No. 11 it appears that 

'^•j = i.k = k.i ~j.k -- k.j — 0, 

^'x^' = 0, kxk — Oy 
ixj ~k, jxk = i, kxi^jy 
jxi=—k, kxj=—iy ixk~—j. 

18. If we resolve jS into two components /5' and of which the 
first is parallel and the second perpendicular to a, we shall have 

a./8 = a./3' and aX/S = aX/3". 

19. a.[/3-hy] = a./S ~h n.y and ax[/3 + y] = aX/8 4- axy. 

To prove this, let o- — ^ + y, and resolve each of the vectors /3, y, cr 
into two components, one parallel and the other perpendicular to a. 
Let these be /8", y, y \ (t\ a'. Then the equations to be proved 
will reduce by the last section to 

a.<T = a./?'-ha.y' and aXcr" = aX/^'^-haXy". 

Now since ar~/3 + y we may form a triangle in space, the sides of 
which shall be y, and cr. Projecting this on a plane perpendicular 
to a, we obtain a triangle having the sides y", and cr", which 
affords the relation a" — jS" y" . If we pass planes perpendicular 
to a through the vertices of the first triangle, they will give on a 
line parallel to a segments equal to y, a. Thus we obtain the 
relation a — + y.- Therefore a.cr' = a./5'-ha.y', since all the cosines 
involved in these products are equal to unity. Moreover, if a is a 
unit vector, we shall evidently have axcr"' = aXi8"-haXy", since the 
effect of the skew multiplication by a upon vectors in a plane 
perpendicular to a is simply to rotate them all 90° in that plane. 
But any case may be reduced to this by dividing both sides of the 
equation to be proved by the magnitude of a. The propositions 
are therefore proved. 

20. Hence, 

[a-f/5].y = a-y'f i^.y, [«4-/3]xy = aXy-h/3xy, 

[g-h/3] ‘ [y+<5] = a.y-ha.^-|-^.y4”/3.<5, 

[a-h/^] X [y-|-^] = aXy-haX^4-^Xy-f/9x^; 
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and, in general, direct and skew products of sums of vectors may be 
expanded precisely as the products of sums in algebra, except that 
in skew products the order of the factors must not be changed 
without compensation in the sign of the term. If any of the terms 
in the factors have negative signs, the signs of the expanded product 
(when there is no change in the order of the factors) will be deter¬ 
mined by the same rules as in algebra. It is on account of this 
analogy with algebraic products that these functions of vectors are 
called products and that other terms relating to multiplication are 
applied to them. 

21. Numerical calculation of direct and skew products .—The 
properties demonstrated in the last two paragraphs (which may be 
briefly expressed by saying that the operations of direct and skew 
multiplication are distributive) aflbrd the rule for the numerical 
calculation of a direct product, or of the components of a skew 
product, when the rectangular components of the factors are given 
numerically. In fact, if 

a = xi-{-yj-\~zk, and 

a.l3 = xx'-\-yy'-^zz', 

and ax^ — {yz' — zy')i’-^{zx'-~xz')j -^(xy' -^yx')k. 

22. Representation of the area of a parallelogram by a skew 
product .—It will be easily seen that ax/3 represents in magnitude 
the area of the parallelogram of which a and /3 (supposed drawn 
from a common origin) are the sides, and that it represents in direc¬ 
tion the normal to the plane of the parallelogram on the side on 
which the rotation from a toward ^ appears counter-clockwise. 

23. Representation of the volume of a parallelopiped by a triple 
product .—It will also be seen that ax/3.y* represents in numerical 
value the volume of the parallelopiped of which a, /3, and y (supposed 
drawn from a common origin) are the edges, and that the value of 
the expression is positive or negative according as y lies on the side 
of the plane of a and /3 on which the rotation from a to ^8 appears 
counter-clockwise, or on the opposite side. 

24. Hence, 

axl3.y = fixy.a = yxa.p — y.axp==a.fixy 

= /8.yXa= —^Xa.y= —yx/3.a= —aXy.yS 
= --y.^Xa= —a.yx/3= ■—/6.aXy. 

It will be observed that all the products of this type, which can be 
made with three given vectors, are the same in numerical value, and 

* Since the sign x is only used between vectors, the skew multiplication in expressions 
of this kind is evidently to be performed first. In other words, the above expression 
must be interpreted as [ax^j.y. 
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that any two such products are of the same or opposite character in 
respect to sign, according as the cyclic order of the letters is the same 
or different. The product vanishes when two of the vectors are 
parallel to the same line, or when the three are parallel to the same 
plane. 

This kind of product may be called the scalar product of the three 
vectors. There are two other kinds of products of three vectors, both 
of which are vectors, viz., products of the type (a.^)y or y(a.)8), and 
products of the type ax[i8xy] or [yx^]xa. 

2t5. = i.ky.j = k.jxi~j-i'Kk=^ '-I. 

From these equations, which follow immediately from those of 
No. 17, the propositions of the last section might have been derived, 
viz., by substituting for a, /?, and y, respectively, expressions of the 
form xi-\-yj-\-zk, x'i + y'j + z'k, and xi-{‘y"j-^z"k* Such a method, 
which may be called expansion in terms of. i, j, and k, will on many 
occasions afford very simple, although perhaps lengthy, demonstrations. 

26. Triple, products containing only two different letters ,—The 
significance and the relations of (a.a)/3, (a./3)a, and ax[aXyS] vrill 
be most evident, if we consider as made up of two components, 
^ and respectively parallel and perpendicular to a. Then 

(a.^)a = (a./3')a = (a.a)^', 
ax[aX/8] = aX[aX/8^]= —(a.a)^. 

Hence, ax[ax/3] = (a./8)a--(a.a)^. 

27. General relation of the vector products of three factors .—In the 
triple product ax[y8xy] we may set 

a = l^-\- my -f- n^ X y, 

unless and y have the same direction. Then 

ax[/3xy] = ^/3x[/3xy] + wyx[iSxy] 

= ^(^.y)/3~i(/8.^)y~m(y.^)y-f m(y.y)/3 
= -y + w-y.y + my. fi)y. 

But ^yS.y4-my.y = a.y, and Ij3,l3+7ny,j8 = a.^. 

Therefore ax[j8xy] = (a.y)i8 —(a.i8)y, 

which is evidently true, when and y have the same directions. It 
may also be written 

[yx/3]xa = ^(y.a)~y(^.a). 

* The student who is familiar with the nature of determinants wiU not fail to observe 
that the triple product a./Sx^ is the determinant formed by the nine rectangular 
components of a, and 7, nor that the rectangular components of ax/3 are determinants 
of the second order formed from the components of a and /3. (See the last equation of 
No, 21.) 
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28. This principle may be used in the transformation of more 
complex products. It will be observed that its application will 
always simultaneously eliminate, or introduce, two signs of skew 
multiplication. 

The student will easily prove the following identical equations, 
which, although of considerable importance, are here given principally 
as exercises in the application of the preceding formulas. 

29. ax[/3xy]4‘/3x[yxa]-fyx[aX/0] = O. 

30. [ax/3].[yX^]-(a.y)(^.^)-(a.W*y)- 

31. [aX/8]x[yX<5] =(a.yx5)/0"-(/3.yx5)a 

= (a./8x^)y~(a.j8xy)<5. 

32. ax[j8x[yx^]] = (a.yx^)^ —(a./3)yX^ 

— (/3.8)aXy-(/3.y) axS. 

33. [aXi8].[yX<5]x[6X^] = (a./8x5)(y.fxf)~(a./3xy)(^.exf) 

= (a./3x6)(f.yx^)~(a.^xf)(f-yx^) 

= (y.^Xa)(^.6xD-(y.^X;S)(a.exD. 

34. [aXi6].[/3xy]x[yXa] = (a./3xy)^. 

35. The student will also easily convince himself that a product 
formed of any number of letters (representing vectors) combined in 
any possible way by scalar, direct, and skew multiplications may l)e 
reduced by the principles of Nos. 24 and 27 to a sum of products, 
each of wliich consists of scalar factors of the forms a./3 and a./3xy, 
with a single vector factor of the form a or ax/S, when the original 
product is a vector. 

36. Elimination of scalars from vector equations. —It has already 
been observed that the elimination of vectors from equations of the 
form 

cm + 5/8 + cy + +etc. = 0 

is performed by the same rule as the eliminations of ordinary algebra. 
(See No. 9.) But the elimination of scalars from such equations is at 
least formally different. Since a single vector equation is the equi¬ 
valent of three scalar equations, we must be able to deduce from such 
an equation a scalar equation from which two of the scalars which 
appear in the original vector equation have been eliminated. We 
shall see how this may be done, if we consider the scalar equation 

aa. X + 6|8.X + cy.\+X + etc. = 0, 

which is derived from the above vector equation by direct multipli¬ 
cation by a vector X. We may regard the original equation as the 
equivalent of the three scalar equations obtained by substituting for 
a, /3, y, S, etc., their X-, Y-, and Z-components. The second equation 
would be derived from these by multiplying them respectively by 
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the X-, Y-, and Z-components of X and adding. Hence the second 
equation may be regarded as the most general form of a scalar equa¬ 
tion of the first degree in a, h, c, d, etc., which can be derived from 
the original vector equation or its equivalent three scalar equations. 
If we wish to have two of the scalars, as b and c, disappear, we have 
only to choose for X a vector perpendicular to /3 and y. Such a 
vector is jSxy. We thus obtain 

aa. X y-h <io. X y + etc. = 0. 

37. Relations of four vectors .—By this method of elimination we 
may find the values of the coefficients a, 6, and c in the equation 

p = aa 4-5/8-I-cy, (1) 

by which any vector p is expressed in terms of three others. (See 
No. 10.) If we multiply directly by /5xy, yxa, and ax/?, we obtain 
p./8xy = aa./3xy, /3.yXa = 5^.yXa, p.aX/8 = Cy.aXj8; (2) 


(3) 


whence 

p.^xy 

a.^Xy' a.j^xy’ a./3xy' 

By substitution of these values, we obtain the identical equation, 

(a./3xy)p = (p./3xy)a -f (p.yXa)^ H- (p.aXj8)y. (4) 

(Compare No. 31.) If we wish the four vectors to appear symmetri¬ 
cally in the equation we may write 

(a./3xy)p “ (/3.yXp)a + (y.pXa)/3 — (p.ax/3)y = 0. (5) 

If we wish to express p as a sum of vectors having directions 
perpendicular to the planes of a and j3, of /8 and y, and of y and u, 
we may write 

p = e^xy-f/yxa4-9fax/3. (6) 


To obtain the values of 6, f g, we multiply directly by a, by /3, and 
by y. This gives 




■' y.ax^’ 


a= . 

^ a./3xy 


fi.yxa 

Substituting these values we obtain the identical equation 


( 7 ) 


(a./3xy)p = (p.a)/3xy -f (p./3) yXa + io-y) aX^. (8) 

(Compare No. 32.) 

38. Reciprocal systems of vectors .—The results of the preceding 
section may be more compactly expressed if we use the abbreviations 


( 1 ) 


a.,8xy’ ^ /S.yXa’ ^ y.aX/3 
The identical equations (4) and (8) of the preceding number thus 
become 


p = (p.a')a + (p.^)^+{p.y)y, 
p = (p.a)a'+(/3./3)/3' +(p-y)y'- 


( 2 ) 

(3) 
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We may infer from the similarity of these equations that the relations 
of a, 13, y, and a\ jS', y are reciprocal, a proposition which is easily 
proved directly. For the equations 


_ /^xy' y Xa _ a'x^ 

““a./S'xy" "’'“y'.a'x/S' 

are satisfied identically by the substitution of the values of a, /S', 
and y given in equations (1). (See Nos. 31 and 34.) 

D^f .—It will be convenient to use the term reciprocal to designate 
these relations, i.e., we shall say that three vectors are reciprocals of 
three others, when they satisfy relations similar fco those expressed in 
equations (1) or (4). 

With this understanding we may say:— 

The coefficients by which any vector is expressed in terms of 
three other vectors are the direct products of that vector with the 
reciprocals of the three. 

Among other relations which are satisfied by reciprocal systems 
of vectors are the following: 


(4) 


a.a =^.^' = y.y =1, 

a./8' = 0, a.y' = 0, ^.a' = 0, /8.y' = 0, y,a' = 0, y./8' = 0. (5) 

These nine equations may be regarded as defining the relations 
between a, /8, y and a\ /S', y as reciprocals. 


(See No. 34.) 


(a.;Sxy)(a'.j8'xy)=l. 

aXa'+/8x/8'+yXy' = 0. 


( 6 ) 

(7) 


(See No. 29.) 

A system of three mutually perpendicular unit vectors is reciprocal 
to itself, and only such a system. 

The identical equation 

p=(p.i)i+(p.j)j+(p.k)k (8) 

may be regarded as a particular case of equation (2). 

The system reciprocal to ax/8, /Sxy, yxa is 
a'x/S', /S'xy', y'xa', 

or a /3 y 

a./3xy* a.P'/.y* a./ 8 xy’ 

38a. If we multiply the identical equation (8) of No. 37 by o-xt, 
we obtain the equation 

(a./8xy)(p.arXT) = a.p(/8.<ry.T—/S.ry.er) 

d“^*p(v*o’ a.T —y.T a.<r)+y.p(a.(r /S.t — a.r /S.cr), 
which is therefore identical But this equation cannot subsist identi- 
cally, unless 

(a./ 9 xy)<TXT = a(/d.<r y.T—/S.T y.o*) 

+ /S(y.<j a.T —y.Ta.cr)-fy(a.<r/8.T —a.T/0.<T) 
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is also an identical equation. (The reader will observe that in each 
of these equations the second member may be expressed as a deter¬ 
minant.) 

From these transformations, with those already given, it follows 
that a product formed of any number of letters (representing vectors 
and scalars), combined in any possible way by scalar, direct, and skew 
multiplications, may be reduced to a sum of products, containing 
each the sign x once and only once, when the original product 
contains it an odd number of times, or entirely free from the sign, 
when the original product contains it an even number of times. 

39. Scalar equations of the first degree with respect to cm unknoivn 
vector. —It is easily shown that any scalar equation of the first 
degree with respect to an unknown vector p, in which all the other 
quantities are known, may be reduced to the form 

p.a = a, 

in which a and a are known. (See No. 35.) Three such equations 
will afford the value of p (by equation (8) of No. 37, or equation (3) 
of No. 38), which may be used to eliminate p from any other equation 
either scalar or vector. 

When we have four scalar equations of the first degree with 
respect to p, the elimination may be performed most symmetrically 
by substituting the values of p.a, etc., in the equation 

(p.u)(^.yxi)-(^.^)(y.ixa)+(p.y)((S.aX^)-(p.5)(o.^Xv)=0, 
which is obtained from equation (8) of No. 37 by multiplying 
directly by 6. It may also be obtained from equation (6) of No. 37 
by writing 6 for p, and then multiplying directly by p. 

40. Solution of a vector equation of the first degree with respect to 
the unknown vector. —It is now easy to solve an equation of the 

^=a(\.p)+/8(M-p)+y(^.p). (1) 

where a, /S, y, S, X, and v represent known vectors. Multiplying 
directly by /Sxy, by yxa, and by ax/3, we obtain 

|8.yX<5 = (/S.yXa)(X.p), y.aX^«(y.aX/8)(^.p), 

a./3xS^{a.0xy)(if.p); 

or a'.5=X.p, l3^.S=iJL.p, y'.5 = i/.p, 

where a', /S', y are the reciprocals of a, /S, y. Substituting these 
values in the identical equation 

p = \'(\.p)+p,'{p.p)+v{v.p), 

in which X', p', p' are the reciprocals of X, p, v (see No. 38), we have 

( 2 ) 

which is the solution required. 
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It results from the principle stated in No. 35, that any vector 
equation of the first degree with respect to p may be reduced to the 

a(\.p) + /3(/w.p)4-'y(i'./o) + ap-HeX/o. 

But ap = aX(\.p) {-ap(fx.p)~^ap'(v.p), 

and €Xp=^€Xy{\.p)^€X/UL(/UL.p)‘}-€Xp(v.p), 

where X', p', v represent, as before, the reciprocals of X, v. By 
substitation of these valies the equation is reduced to the form of 
equation (1), which may therefore be regarded as the most general 
form ol a vector equation of the first degree with respect to p. 

41. r.elations between two normal systems of unit vectors .—If 
ky and i',j\ k' are two normal systems of unit vectors, we have 
i'=^(i.i')i-{^(j.i')j+(k.i')ky 1 
j' = (i.f )i -f(i ./)j -f (k.j')ky V {1) 

k' = (i.k')i-h(j.k')j-\-(k.k')k, J 
and i=^(i.i')i'-\-(i.j')j'-\-(i.k')k\ ] 

k^(k.i')i'-\-(k.j')f ■^(k.k')k'. J 
(See equation (8; of No. 38.) 

The nine coefficients in these equations are evidently the cosines 
of the nine angles made by a vector of one system with a vector of 
the other system. The principal relations of these cosines are easily 
deduced. By direct multiplication of each of the preceding equations 
with itself, we obtain six equations of the type 

(iAy+(j.iy-h(k.iy=^i. (3) 

By direct multiplication of equations (1) with each other, and of 
equations (2) with each other, we obtain six of the type 

(i.i') (i.f) + (j.i') (j.f)+(k.i')(k.f)= 0. (4) 

By skew multiplication of equations (1) with each other, we obtain 
three of the type 

k' = {(j-i'){kf)-(k.i')(j.f)}i + {(k.i')(i.f)-(i.i')(k.f)]j 

+ {(i-i')(j-f)-(j-i')(i-f))k. 

Comparing these three equations with the original three, we obtain 
nine of the type 

i.k' = {j.i')(k.j') - {k.i'Xj ./). (5) 

Finally, if we equate the scalar product of the three right hand 
members of (1) with that of the three left hand members, we obtain 

-(k.i'){j.j')(i.k')^(k.j'){j.kyi.i')^{k.k')(j.i')(i.j') = l. (6) 

Equations (1) and (2) (if the expressions in the parentheses are 
supposed replaced by numerical values) represent the linear relations 
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which subsist between one vector of one system and the three vectors 
of the other system. If we desire to express the similar relations 
which subsist between two vectors of one system and two of the other, 
we may take the skew products of equations (1) with equations (2), 
after transposing all terms in the latter. This will afford nine 
equations of the type 

(i.j')k'-(,i.k')j'=(k.i')j-(j.i')k. (7) 

We may divide an equation by an indeterminate direct factor. [MS. note by author.] 


CHAPTER 11. 

CONCERNING THE DIFFERENTIAL AND INTEGRAL CALCULUS OF VECTORS. 

42. Differentials of vectors. —The differential of a vector is the 
geometrical difference of two values of that vector which differ 
infinitely little. It is itself a vector, and may make any angle with 
the vector differentiated. It is expressed by the same sign {d) as the 
differentials of ordinary analysis. 

With reference to any fixed axes, the components of the differential 
of a vector are manifestly equal to the differentials of the components 
of the vector, i.e., if a, and y are fixed unit vectors, and 

p = £ca-|-2//3 + 0y, 
dp = dxa-\-dy ^-{-dzy. 

43. Differential of a function of several variables. —The differential 
of a vector or scalar function of any number of vector or scalar 
variables is evidently the sum (geometrical or algebraic, according as 
the function is vector or scalar) of the differentials of the function 
due to the separate variation of the several variables. 

44. Differential of a jproduct. —The differential of a product of any 
kind due to the variation of a single factor is obtained by prefixing 
the sign of differentiation to that factor in the product. This is 
evidently true of differentials, since it will hold true even of finite 
differences. 

45. From these principles we obtain the following identical 


equations: 

c2(a + y8) = c?a 4“ (1) 

d{na)=^dna-\-nday (2) 

d{a.p) = da.^ + a.d^y (3) 

d[ax/3] = dax^-raxd^y (4) 

d(a.^xy) = da,/3xy~ha,d^xy + a.l3xdy, (5) 

d[(a./3) y] = (c^a./8)y-f (a.(i^)y-f (a./3)c2y. (6) 
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46. Differential coefficient with respect to a scalar. —The quotient 
obtained by dividing the differential of a vector due to the variation 
of any scalar of which it is a function by the differential of that 
scalar is called the differential coefficient of the vector with respect 
to the scalar, and is indicated in the same manner as the differential 
coefficients of ordinary analysis. 

If we suppose the quantities occurring in the six equations of the 
last section to be functions of a scalar t^ we may substitute ^ for d in 

those equations since this is only to divide all terms by the scalar dt. 

47. Successive differentiations. —The differential coefficient of a 
vector with respect to a scalar is of course a finite vector, of which 
we may take the differential, or the differential coefficient with 
respect to the same or any other scalar. We thus obtain differential 
coefficients of the higher orders, which are indicated as in the scalar 
calculus. 

A few examples will serve for illustration. 

If p is the vector drawn from a fixed origin to a moving point at 
any time t, will be the vector representing the velocity of the 


point, and the vector representing its acceleration. 

If p is the vector drawn from a fixed origin to any point on a 
curve, and s the distance of that point measured on the curve from 

any fixed point, ^ is a unit vector, tangent to the curve and having 
the direction in which s increases; is a vector directed from a 
point on the curve to the center of curvature, and equal to the 
curvature; x is the normal to the osculating plane, directed to 

the side on which the curve appears described counter-clockwise 
about the center of curvature, and equal to the curvature. The 
tortuosity (or rate of rotation of the osculating plane, considered as 
positive when the rotation appears counter-clockwise as seen from 
the direction in which s increases) is represented by 


dp d^p d^p 
ds ds^ ^ df^ 

ds^ ds^ 


48. Integration of an equation between differentials. —If t and u 
are two single-valued continuous scalar functions of any number of 
scalar or vector variables, and 

dt = duj 

then ^ = u-|-a, 

where a is a scalar constant. 
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Or, if T and w are two single-valued continuous vector functions 
of any number of scalar or vector variables, and 

dT=du), 

then T = ft)-ha, 

where a is a vector constant. 

When the above hypotheses are not satisfied in general, but will be 
satisfied if the variations of the independent variables are confined 
within certain limits, then the conclusions will hold within those limits, 
provided that we can pass by continuous variation of the independent 
variables from any values within the limits to any other values 
within them, without transgressing the limits. 

49. So far, it will be observed, all operations have been entirely 
analogous to those of the ordinary calculus. 

Functicms of Position in Space. 

50. Bef .—If u is any scalar function of position in space (i.e., any 
scalar quantity having continuously varying values in space), Vu is 
the vector function of position in space which has everywhere the 
direction of the most rapid increase of u, and a magnitude equal to 
the rate of that increase per unit of length. Vu may be called the 
derivative of tt, and te., the primitive of Vu. 

We may also take any one of the Nos. 51, 52, 53 for the definition 
of Vu. 


51. If p is the vector defining the position of a point in space, 
du=^Vu.dfj. 


52. 

_ . du . du , , du 


53. 

du . ^ du du , ^ 

= -t'=^3-Vu, -T-=^k.Vu. 

dx dy dz 


54. Def- 

-If ft) is a vector having continuously varying values in space, 


. dud , . dw , , dw 

( 1 ) 

and 

— . dUo . . c£ft) , , dw 

(2) 

V.o) is called the divergerice of « and Vxoo its curl. 


If we set 

ft!= Xi -4--f" , 


we obtain by substitution the equations 



- dX dY dZ 

dx dy dz 


and 

\dy dzJ •'\dz dx/ \dx dy J 


which may 

also be regarded as defining V.w and Vxw. 
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55. Surface-integrals .—The integral ffw.da-, in which da represents 
an element of some surface, is called the surface-integral of w for that 
surface. It is understood here and elsewhere, when a vector is said 
to represent a plane surface (or an element of surface which may be 
regarded as plane), that the magnitude of the vector represents the 
area of the surface, and that the direction of the vector represents 
that of the normal drawn toward the positive side of the surface. 
When the surface is defined as the boundary of a certain space, the 
outside of the surface is regarded as positive. 

The surface-integral of any given space (i.e., the surface-integral 
of the surface bounding that space) is evidently equal to the sum of 
the surface-integrals of all the parts into which the original space 
may be divided. For the integrals relating to the surfaces dividing 
the parts will evidently cancel in such a sum. 

The surface-integral of <d for a closed surface bounding a space dv 
infinitely small in all its dimensions is 

V. o) dv. 

This follows immediately from the definition of Vo), when the space 
is a parallelepiped bounded by planes perpendicular to i, j, k. In 
other cases, we may imagine the space—-or rather a space nearly 
coincident with the given space and of the same volume dv —to be 
divided up into such parallelepipeds. The surface-integral for the 
space made up of the parallelepipeds will be the sum of the surface- 
integrals of all the parallelepipeds, and will therefore be expressed by 
V.wdv. The surface-integral of the original space will have sensibly 
the same value, and will therefore be represented by the same 
formula. It follows that the value of V. a> does not depend upon the 
system of unit vectors employed in its definition. 

It is possible to attribute such a physical signification to the 
quantities concerned in the above proposition, as shall make it evident 
almost without demonstration. Let us suppose w to represent a flux 
of any substance. The rate of decrease of the density of that sub¬ 
stance at any point will be obtained by dividing the surface-integral 
of the flux for any infinitely small closed surface about the point by 
the volume enclosed. This quotient must therefore be independent of 
the form of the surface. We may define V.w as representing that 
(|Uotient, and then obtain equation (1) of No. 54 by applying the 
general principle to the case of the rectangular parallelopiped. 

56. Skew surface-integrals .—The integral ffda'x.tjo may be called 
the skew surface-integral of w. It is evidently a vector. For a 
closed surface bounding a space dv infinitely small in all dimensions 
this integral reduces to Vxa»c?v, as is easily shown by reasoning like 
that of No. 55. 
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57. Integration. —If dv represents an element of any space, and dcr 
an element of the bounding surface, 

fffV.oddv-Jfw.dar. 

For the first member of this equation represents the sum of the 
surface-integrals of all the elements of the given space. We may 
regard this principle as affording a means of integration, since we 
may use it to reduce a triple integral (of a certain form) to a double 
integral. 

The principle may also be expressed as follows: 

The surface-integral of any vector function of position in space for 
a closed surface is equal to the volume-integral of the divergence of 
that function for the space enclosed. 

58. Line-integrals. —The integral J’w.dp, in which dp denotes the 
element of a line, is called the line-integral of co for that line. It is 
implied that one of the directions of the line is distinguished as 
positive. When the line is regarded as bounding a surface, that side 
of the surface will always be regarded as positive, on which the 
surface appears to be circumscribed counter-clockwise. 

59. Integration. —From No. 51 we obtain directly 

fVu.dp~n"--n\ 

where the single and double accents distinguish the values relating 
to the beginning and end of the line. 

In other w^ords,—-The line-integral of the derivative of any (con¬ 
tinuous and single-valued) scalar function of position in space is equal 
to the difference of the values of the function at the extremities of 
the line. For a closed line the integral vanishes. 

60. Integration. —The following principle may be used to reduce 
double integrals of a certain form to simple integrals. 

If da represents an element of any surface, and dp an element of 
the bounding line, 

w.da—J'w.dp. 

In other words,—The line-integral of any vector function of position 
in space for a closed line is equal to the surface-integral of the curl of 
that function for any surface bounded by the line. 

To prove this principle, we will consider the variation of the 
line-integral which is due to a variation in the closed line for which 
the integral is taken. We have, in the first place, 

tx>.dp= J'Sod,dp-\- J'dp. 

But oi>.Sdp = d((a.Sp) — d(a.op. 

Therefore, since J'd(,(a.Sp) = 0 for a closed line, 

Sjw.dp= J*Sa>.dp’—J'cUn.Sp. 
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No, J..sg&]-5:[g(i.Jp>], 

,nd 

where the summation relates to the coordinate axes and connected 
quantities. Substituting these values in the preceding equation, 

Sfm.dp =/2 {{i.Sp) -i.i-dp) (^- ^p)) ’ 

or by No. 30, 

Sj'uy.dp— Xa).[^/OXC?/o]- 


But Spxdp represents an element of the surface generated by the 
motion of the element dp, and the last member of the equation is 
the surface-integral of Vxco for the infinitesimal surface generated 
by the motion of the whole line. Hence, if we conceive of a closed 
curve passing gradually from an infinitesimal Loop to any finite form, 
the differential of the line-integral of o) for that curve will be equal 
to the differential of the surface integral of Vxo) for the surface 
generated: therefore, since both integrals commence with the value 
zero, they must always be equal to each other. Such a mode of 
generation will evidently apply to any surface closing any loop. 

61. The line-integral of for a closed line bounding a plane surface 
d(r infinitely small in all its dimensions is therefore 

Vx(i>.dcr. 

This principle affords a definition of Vx<o which is independent 
of any reference to coordinate axes. If we imagine a circle described 
about a fixed point to vary its orientation while keeping the same 
size, there will be a certain position of the circle for which the line- 
integral of w will be a maximum, unless the line-integral vanishes for 
all positions of the circle. The axis of the circle in this position, 
drawn toward the side on which a positive motion in the circle 
appears counter-clockwise, gives the direction of Vx», and the 
quotient of the integral divided by the area of the circle gives the 
magnitude of Vx«. 


V, V., and V x applied to Functione of Fv/nctiona of Position, 


62. A constant scalar factor after V, V., or Vx may be placed 
before the symbol. 

63. If f{u) denotes any scalar function of tt, and f{u) the derived 


function, 


V/(u)=/(u)Vu. 
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64. If u or tt) is a function of several scalar or vector variables 
which are themselves functions of the position of a single point, the 
value of Vu or V.w or Vxw will be equal to the sum of the values 
obtained by making successively all but each one of these variables 
constant. 

65. By the use of this principle we easily derive the following 


identical equations: 

V^-fVu. (1) 

= V.T-f V.60. Vx[T4*ft>] = Vxt 4-Vxox (2) 

V — (3) 

V. {uw) — ft). Vu -h uV .ft). (4) 

Vx[uft)] = uVxft) —ftJxVtt. (5) 

V.[TXtt)] = ft).VXT--T.VXft). (6) 


The student will observe an analogy between these equations and 
the formulae of multiplication. (In the last four equations the 
analogy appears most distinctly when we regard all the factors but 
one as constant.) Some of the more curious features of this analogy 
are due to the fact that the V contains implicitly the vectors j 
and hy which are to be multiplied into the following quantities. 

Combinations of the Operators V, V., and Vx. 

66. If 16 is any scalar function of position in space, 

VxVu=0, 

as may be derived directly from the definitions of these operators. 

67. Conversely, if w is such a vector function of position in space 

that ^ ^ 

Vxft)==0, 

ft) is the derivative of a scalar function of position in space. This will 
appear from the following considerations ; 

The line-integraiyo). dp will vanish for any closed line, since it may 
be expressed as the surface-integral of Vxft). (No. 60.) The line- 
integral taken from one given point P' to another given point F" is 
independent of the line between the points for which the integral 
is taken. (For, if two lines joining the same points gave different 
values, by reversing one we should obtain a closed line for which the 
integral would not vanish.) If we set u equal to this line-integral, 
supposing P" to be variable and P' to be constant in position, u will 
be a scalar function of the position of the point P", satisfying the 
condition du — va.dpy or, by No. 51, Vu=(o. There will evidently be 
an infinite number of functions satisfying this condition, which will 
differ from one another by constant quantities. 
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If the region for which Vxtt)==0 is unlimited, these functions will 
be single-valued. If the region is limited, but acyclic,* the functions 
will still be single-valued and satisfy the condition = w within the 
same region. If the region is cyclic, we may determine functions 
satisfying the condition Vu = a) within the region, but they will not 
necessarily be single-valued. 

68 . If o) is any vector function of position in space, V.Vx« = 0. 
This may be deduced directly from the definitions of No. 54. 

The converse of this proposition will be proved hereafter. 

69. If u is any scalar function of position in space, we have by 
Nos. 52 and 54 


V.Vu=( 


(P\ 


70. Be /.—If to is any vector function of position in space, we may 
define V.Vm by the equation 




the expression V.V being regarded, for the present at least, as a single 
operator when applied to a vector. (It will be remembered that no 
meaning has been attributed to V before a vector.) It should be 
noticed that if 

(JO = 'iX -hi Y -h ^Z, 


V, Vo) - iV. VX + i V, VY + 

that is, the operator V.V applied to a vector affects separately its 
scalar components, 

71. From the above definition \vith those of Nos. 52 and 54 we 
may easily obtain 

V.Va? = VV.ft) — VxVxo). 


The effect of the operator V.V is therefore independent of the direc¬ 
tions of the axes used in its definition. 

72. The expression — ^a®V.Vw, where a is any infinitesimal scala)-, 
evidently represents the excess of the value of the scalar function u 


* If every closed line within a given region can contract to a single point without 
breaking its continuity, or passing out of the region, the region is called acyclic^ other¬ 
wise cyclic. 

A cyclic region may be made acyclic by diaphragms, which must then be regarded as 
forming part of the surface bounding the region, each diaphragm contributing its own 
area twice to that surface. This process may be used to reduce many-valued functions 
ot position in space, having single-valued derivativesg to single-valued functions. 

When functions are mentioned or implied in the notation, the reader will always 
understand single-valued functions, unless the contrary is distinctly intimated, or the 
cose is one in which the distinction is obviously immaterial. Diaphragms may be 
applied to bring functions naturally many-valued under the application of some of the 
following theorems, as Nos. 74 ff. 
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at the point considered above the average of its values at six points 
at the following vector distances: ai, -^aiy ajy —(ij, ah, ^ak. Since 
the directions of i, j, and k are immaterial (provided that they are 
at right angles to each other), the excess of the value of u at the 
central point above its average value in a spherical surface of radius 
a constructed about that point as the center will be represented by 
the same expression, — 

Precisely the same is true of a vector function, if it is understood 
that the additions and subtractions implied in the ternis ai'erage and 
excess are geometrical additions and subtractions. 

Maxwell has called —V.Vu the concentration of u, whether u is 
scalar or vector. We may call V.Vu (or V.Vcu), which is proportioned 
to the excess of the average value of the function in an infinitesimal 
spherical surface above tlie value at the center, the dispersioio of 
u (or w). 

Transformation of Definite Integrals. 

73. From the equations of No. 65, with the principles of integration 
of Nos. 57, 59, and 60, we may deduce various transformations of 
delinite integrals, which are entirely analogous to those known in the 
scalar calculus under the name of integratio'n by parts. The follow¬ 
ing formulae (like those of Nos. 67, 59, and 60) are 'svritten for the 
case of continuous values of the quantities (scalar and vector) to which 
the signs V, V., and Vx are applied. It is left to the student to 
complete the formulae for cases of discontinuity in these values. The 
manner in which this is to be done may in each case be inferred from 
the nature of the formula itself. The most important discontinuities 
of scalars are those which occur at surfaces: in the case of vectors 
discontinuities at surfaces, at lines, and at points, should be considered. 

74. From equation (8) we obtain 

fV {in), dp = t"u" — t'u = faVt.dp -^-ftVn.dpy 

where the accents distinguish the quantities relating to the limits of 
the line-integrals. We are thus able to reduce a line-integral of the 
form JuVt.dp to the form ^ftVu.dp with quantities free from the 
sign of integration. 

75. From equation (5) we obtain 

ffV x{u(jo) .da-=f UAa.dp =^Jj uV xto.da— JJ wx Vu. d<T, 

where, as elsewhere in these equations, the line-integral relates to the 
boundary of the surface-integral. 

From this, by substitution of Vt for w, we may derive as a 
particular case 

ffVuxVt.da^fuVt.dp^ —ftVu.dp. 
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76. From equation (4) we obtain 

[uoi>\dv « jyuoo.dar s= £[fu),Vu dv +fff itV. a> dv, 

where, as elsewhere in these equations, the surface-integral relates to 
the boundary of the volume-integrals. 

From this, by substitution of Vt for w, we derive as a particidar case 

fffVt.Vu dv ^ffuVUda-^fffnVm dv ^fftVu.da-ffftV.Vu dv, 

which is Green's Theorem. The substitution of sVt for w gives the 
more general form of this theorem which is due to Thomson, viz., 

fffsVt.Vu dv « ffuaVt.da-^fffuV, [sVt^dv 
r^fft8Vu.d<r^ffftV. [8Vu]dv. 

77. From equation (6) we obtain 

[tx w] dv rxto.dcr ^fff w.V xt dv --fff r. Vxw dv, 

A particular case is 

flfVu,Vy.aidn)^fff^'KVu.dcr. 

Integration of Differential Equations. 

78. If throughout any continuous space (or in all space) 

VttasaO, 

then throughout the same space 

Us constant. 

79. If throughout any continuous space (or in all space) 

V.VusO, 

and in any hnite part of that space, or in any finite surface in or 
bounding it, 

VusO, 

then throughout the whole space 

VusO, and us constant. 

This will appear from the following considerations: 

If Vu s 0 in any finite part of the space, u is constant in that part. 
If u is not constant throughout, let us imagine a sphere situated 
principally in the part in which u is constant, but projecting slightly 
into a part in which u has a greater value, or else into a part in 
which u has a less. The surface-integral of Vu for the part of the 
spherical surface in the region where u is constant will have the 
value zero: for the other part of the surface, the integral will be 
either greater than zero, or less than zero. Therefore the whole 
surface-integral for the spherical surface will not have the value zero, 
which is required by the general condition, V.Vu = 0. 

Again, if Vu = 0 only in a surfcu^e in or bounding the space in 
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which V.Vu = 0, u will be constant in this surface, and the surface 
will be contiguous to a region in which V.Vu = 0 and u has a greater 
value than in the surface, or else a less value than in the surface. 
Let us imagine a sphere lying principally on the other side of the 
surface, but projecting slightly into this region, and let us particularly 
consider the surface-integral of Vu for the small segment cut off by 
the surface Vu = 0. The integral for that part of the surface of the 
segment which consists of part of the surface Vu = 0 will have the 
value zero, the integral for the spherical part will have a value either 
greater than zero or else less than zero. Therefore the integral for 
the whole surface of the segment cannot have the value zero, which 
is demanded by the general condition, V.Vu = 0. 

80. If throughout a certain space (which need not be continuous, 
and which may extend to infinity) 

V.Vtt —0, 


and in all the bounding surfaces 

'ti = constant = a, 

and (in case the space extends to infinity) if at infinite distances 
within the space u = a,—then throughout the space 
Vu = 0, and u = a. 

For, if anywhere in the interior of the space Vu has a value 
different from zero, we may find a point P where such is the case, 
and where u has a value h different from a,—to fix our ideas we will 
say less. Imagine a surface enclosing all of the space in which u <h. 
(This must be possible, since that part of the space does not reach to 
infinity.) The surface-integral of Vu for this surface has the value 
zero in virtue of the general condition V.Vu = 0. But, from the 
manner in which the surface is defined, no part of the integral can be 
negative. Therefore no part of the integral can be positive, and the 
supposition made with respect to the point P is untenable. That the 
supposition that h> a is untenable may be shown in a similar 
manner. Therefore the value of u is constant. 

This proposition may be generalized by substituting the condition 
V.[#Vu] = 0 for V.Vu = 0, t denoting any positive (or.any negative) 
scalar function of position in space. The conclusion would be the 
same, and the demonstration similar. 

81. If throughout a certain space (which need not be continuous, 
and which may extend to infinity) 

V.Vu=0, 


and in all the bounding surfaices the normal component of Vu vanishes, 
and at infinite distances within the space (if such there are) 
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where r denotes the distance from a fixed origin, then throughout the 
space 

Vtc = 0, 

and in each continuous portion of the same 

u = constant. 


For, if anywhere in the space in question Vu has a value different 
from zero, let it have such a value at a point P, and let u be there 
equal to h. Imagine a spherical surface about the above-mentioned 
origin as center, enclosing the point P, and with a radius r. Con¬ 
sider that portion of the space to which the theorem relates which is 
within the sphere and in which n<h. The surface integral of Vu- 
for this space is equal to zero in virtue of the general condition 
V.Vu~0. That part of the integral (if any) which relates to a 
portion of the spherical surface has a value numerically not greater 

than 4i7rr ^(^^^, where denotes the greatest numerical value of 

™ in the portion of the spherical surface considered. Hence, the 

value of this part of the surface-integral may be made less (numeri¬ 
cally) than any assignable quantity by giving to r a sufficiently great 
value. Hence, the other part of the surface-integral (viz., that relating 
to the surface in which 16 = 6, and to the boundary of the space to 
which the theorem relates) may be given a value differing from zero 
by less than any assignable quantity. But no part of the integral 
relating to this surface can be negative. Therefore no part can be 
pasitive, and the supposition relative to the point P is untenable. 

This proposition also may be generalized by substituting V.[tVu] = 0 

for V.Vu = 0, and tr^^ = 0 for r*^ = 0. 

a/r dr 

82. If throughout any continuous space (or in all space) 

then throughout the same space 

^ = const. 


The truth of this and the three following theorems will be apparent if 
we consider the difference t — u. 

83. If throughout any continuous space (or in all space) 

V.V< = V.Vu, 

and in any finite part of that space, or in any finite surface in or 
bounding it, 

Vu, 


then throughout the whole space 

V<s=Vu, and t=u-f const. 
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84. If throughout a certain space (which need not be continuous, 
and which may extend to infinity) 

V.Vt=V.Vu, 

and in all the bounding surfaces 

and at infinite distances within the space (if such there are) 

< = u, 

then throughout the space 

t = u. 

86 . If throughout a certain space (which need not be continuous, 
and which may extend to infinity) 

V.V^ = V.Vu, 

and in all the bounding surfaces the normal components of and Vti 
are equal, and at infinite distances within the space (if such there are) 

— = where r denotes the distance from some fixed origin, 

—then throughout the space 

Vt=Vu, 

and in each continuous part of which the space consists 

f — u = constant. 

86 . If throughout any continuous space (or in all space) 

VxT = Vx<» and V.t = V.&), 

and in any finite part of that space, or in any finite surface in or 
bounding it, 

then throughout the whole space 

T = W. 

For, since Vx(t —a>) = 0, we may set = t —w, making the space 
acyclic (if necessary) by diaphragms. Then in the whole space u is 
single-valued and V.Vu = 0, and in a part of the space, or in a surface 
in or bounding it, Vu ~ 0. Hence throughout the space Vu = t — a) = 0. 

87. If throughout an aperiphractic”*^ space contained within finite 
boundaries but not necessarily continuous 

VxT = Vx<» and 

and in all the bounding surfaces the tangential components of t and 
CD are equal, then throughout the space 

T = <D- 

It is evidently sufficient to prove this proposition for a continuous 
space. Setting Vu = t —<d, we have V.Vu = 0 for the whole space, 

* If a space encloses within itself another space, it is called periphractict otherwise 
aperiphractic. 
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and u = constant for its boundary, which will be a single surface for a 
continuous aperiphractic space. Hence throughout the space 

VU = T —O) = 0. 

88. If throughout an acyclic space contained within finite boundaries 
but not necessarily continuous 

VxT = Vxet) and V.t = V.&), 

and in all the bounding surfaces the normal components of t and m 
are equal, then throughout the whole space 

T = <D. 

Setting we have V.Vu = 0 throughout the space, and 

the normal component of Vu at the boundary equal to zero. Hence 
throughout the whole space Vu = t — <i> = 0. 

89. If throughout a certain space (which need not be continuous, 
and which may extend to infinity) 

V.Vt = V.Vft) 

and in all the bounding surfaces 

T = a), 

and at infinite distances within the space (if such there are) 

T = 

then throughout the whole space 

T — to). 

This will be apparent if we consider separately each of the scalar 
components of r and w. 

Mvnimtim Values of the Volume-vnlegral fff uw.wdv. 
{Thomson's Theorems,) 

90. Let it be required to determine for a certain space a vector 
function of position w subject to certain conditions (to be specified 
hereafter), so that the volume-integral 

fffua.adv 

for that space shall have a minimum value, u denoting a given 
positive scalar function of position. 

.a. In the first place, let the vector w be subject to the conditions 
that V.oi is given within the space, and that the normal component 
of « is given for the bounding surface. (This component must of 
course be such that the surface-integral of <o shall be equal to the 
volume-integral /V-wdu If the space is not continuous, this must 
be true of each continuous portion of it. See No. 57.) The solution 
is that Vx(u«>)=0, or more generally, that the line-integral of vm for 
any closed curve in the space shall vanish. 



VECTOB ANALYSIS. 


43 


The existence of the minimum requires that 
while <5(*) is subject to the limitation that 


and that the normal component of do) at the bounding surface vanishes. 
To prove that the line-integral of ua> vanishes for any closed curve 
within the spaoe, let us imagine the curve to be surrounded by an 
infinitely slender tube of normal section dz, which may be either 
constant or variable. We may satisfy the equation V.^ft) = 0 by 


making 56e> = 0 outside of the tube, and 6wdz~oa 


dp 


within it, 8a 


denoting an arbitrary infinitesimal constant, p the position-vector, and 
da an element of the length of the tube or closed curve. We have then 


JJJuia.Soddv^^fuoa.Soadzda^Jute.dp Sa = 8ajuw. dp = 0, 


whence 


Jua>,dp=:0. 


Q.B.D. 


We may express this result by saying that tuo is the derivative of a 
single-valued scalar function of position in space. (See No. 67.) 

If for certain parts of the surface the normal component of co is not 
given for each point, but only the surface-integral of w for each such 
part, then the above reasoning will apply not only to closed curves, 
but also to curves commencing and ending in such a part of the 
surface. The primitive of uco will then have a constant value in 
each such part. 

If the space extends to infinity and there is no special condition 
respecting the value of w at infinite distances, the primitive of tuc 
will have a constant value at infinite distances within the space or 
within each separate continuous part of it. 

If we except those cases in which the problem has no definite 
meaning because the data are such that the integral Juw.wdv must 
be infinite, it is evident that a minimum must always exist, and (on 
accoimt of the quadratic form of the integral) that it is unique. That 
the conditions just found are sufficient to insure this minimum, is 
evident from the consideration that any allowable values of <5a) may 
be made up of such values as we have supposed. Therefore, there 
will be one and only one vector function of position in space which 
satisfies these conditions together with those enumerated at the 
beginning of this number. 

b. In the second place, let the vector w be subject to the conditions 
that Vxw is given throughout the space, and that the tangential 
component of ta is given at the bounding surface. The solution is that 

V.[u«]»0, 
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and, if the space is periphractic, that the surface-integral of uw 
vanishes for each of the bounding surfaces. 

The existence of the minimum requires that 

ffjn (a .S(3ddv=^0, 

while Sta is subject to the conditions that 

Vx^e«> = 0, 

and that the tangential component of in the bounding surface 
vanishes. In virtue of these conditions we may set 

Sw — VSq, 

where Sq is an arbitrary infinitesimal scalar function of position, sub¬ 
ject only to the condition that it is constant in each of the bounding 
surfaces. (See No. 67.) By substitution of this value we obtain 
X/Ju ft). V Sq dv = 0, 
or integrating by parts (No. 76) 

JJu ft), dcr Sq — JXf^- ft)]^(7 dv — 0. 

Since Sq is arbitrary in the volume-integral, we have throughout the 
whole space 

V. [u ft)] = 0 ; 

and since Sq has an arbitrary constant value in each of the bounding 
surfaces (if the boundary of the space consists of separate parts), we 
have for each such part 

jju ft).rZ<T = 0. 


Potentials, Newtonians, Laplacians. 


91. Bef .—If vf is the scalar quantity of something situated at a 
certain point p, the 'potential of u' for any point p is a scalar function 
of p, defined by the equation 


pot u' = 


u 

[p'-pV 


and the Newtonian of u' for any point p is a vector function of p 
defined by the equation 

new u — ~^ u'. 

[p -p]l 


Again, if to' is the vector representing the quantity and direction of 
something situated at the point p', the potential and the Laplacian of 
to' for any point p are vector functions of p defined by the equations 


pot ft)' = 


[p-p]o’ 


lap<o' = r^ -^ Xttf'. 

[p-p]5 
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92. If It or w is a scalar or vector function of position in space, 
we may write Potu, Newu, Pota>, Lapo) for the volume-integrels of 
pot u', etc., taken as functions of p' ; i.e., we may set 

pot u'dv' 

New w=^ new u'dv'^fff^^^^^vfdv', 
w'dv' =fff^^'^-^dv', 

where the p is to be regarded as constant in the integration. This 
extends over all space, or wherever the u' or u) have any values 
other than zero. These integrals may themselves be called (integral) 
potentials, Newtonians, and Laplacians. 


93. 


rZPotU p du dVotw 


Pot 


do) 

dx‘ 


This will be evident with respect both to scalar and to vector functions, 
if we suppose that when we differentiate the potential with respect 
to X (thus varying the position of the point for which the potential 
is taken) each element of volume dv' in the implied integral remains 
fixed, not in absolute position^ but in position relative to the point 
for which the potential is taken. This supposition is evidently allow¬ 
able whenever the integration indicated by the symbol Pot tends to a 
definite limit when the limits of integration are indefinitely extended. 

Since we may substitute y and z for x in the preceding formula, 
and since a constant factor of any kind may be introduced under the 
sign of integration, we have 

V Pot u==Pot Vl6, 

V.Pota>=Pot V.a>, 

V X Pot w = Pot Vx ft), 

V.V Pot u = Pot V.Vu, 

V. V Pot ft)=Pot V. Vft), 

i.e., the symbols V, V., Vx, V.V may be applied indifferently before 
or after the sign Pot. 

Yet a certain restriction is to be observed. When the operation of 
taking the (integral) potential does not give a definite finite value, 
the first members of these equations are to be regarded as entirely 
indeterminate, but the second members may have perfectly definite 
values. This would be the case, for example, if u or cd had a constant 
value throughout all space. It might seem harmless to set an inde¬ 
finite expression equal to a definite, but it would be dangerous, since 
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we might with equal right set the indefinite expression equal to other 
definite expressions, and then be misled into supposing these definite 
expressions to be equal to one another. It will be safe to say that 
the above equations will hold, provided that the potential of u or w 
hoe a definite value. It will be observed tl^at whenever Potu or 
Poto) has a definite value in general (i.e., with'the possible exception 
of certain points, lines, and surfaces),* the first members of all these 
equations will have definite values in general, and therefore the 
second members of the equation, being necessarily equal to the first 
members, when these have definite values, will also have definite 
values in general. 

94. Again, whenever Potu has a definite value we may write 
V Pot u = V dv’ dv', 

where r stands for [p' — pjo- But 



whence V Pot u = New u. 

Moreover, New u will in general have a definite value, if Pot u has. 

95. In like manner, whenever Pot to has a definite value, 

VxPot a, = dv'=JXf'Vx~ dv' dv’. 

Substituting the value of V ^ given above we have 
VxPotto)=Lap o). 

Lap o) will have a definite value in general whenever Pot w has. 

96. Hence, with the aid of No. 93, we obtain 

Vx Lap ft) = Lap VXft), 

V,Lapft)==0, 

whenever Pot w has a definite value. 

97. By the method of No. 93 we obtain 

V. New u =u' dv = fffVu'. dv'. 

To find the value of this integral, we may regard the point p, which 
is constant in the integration, as the center of polar coordinates. 
Then r becomes the radius vector of the point p\ and we may set 

dv' :=r^dq dr^ 

* Whenever it is said that a function of position in space has a definite value »n 
general^ this phrase is to be understood as explained above. The term definite is 
intended to exclude both indeterminate and infinite values. 
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where r^dq is the element of a spherical surface having center at p 
and radius r. We may also set 

r a/r 

We thus obtain 

V. New «. dq&r=^ - 4irttV=« - 4TrMV=o. 

where u denotes the average value of u in a spherical surface of 
radius r about the point p as center. 

Now if Pot u has in general a definite value, we must have u' = 0 
for r= 00 . Also, V. New u will have in general a definite value. For 
r = 0 . the value of u' is evidently u. We have, therefore, 

V. New = — 4 ' 7 ru, 

V, V Pot u = — iiTru* 


98. If Pot ta has in general a definite value, 

V. V Pot u >=V. V Pot [ui+vj 4 - wk] 

= V. V Pot ui -f V. V Pot -f V. V Pot wk 
= — 4‘7rU'i — 47rtj; — 47^^^;^ 

= — 4xco. 

Hence, by No. 71, 

V X V X Pot ft) — W. Pot ft) = 47 rft). 


That is, 

If we set 

we have 


Lap Vxft) — New V. o) = 4x0). 


0 ,^- —Lap Vxft), ^" 2 —^ 

ft) = 0)j 4- <*>2, 


' 2 =- 4 ^ New V.ft), 


where and eoj are such functions of position that V.ft)j = 0 , and 
Vxo) 2 = 0 . This is expressed by saying that ooi is aolenoidaZ, and 
irrotational. Potft)i and Poto),, like Poto), will have in general 
definite values. 

It is worth while to notice that there is only one way in which a 
vector function of position in space having a definite potential can 
be thus divided into solenoidal and irrotational parts having definite 
potentials. For if o)j 4 -€, coj —e are two other such parts, 


V.e = 0 and Vxe=0. 


Moreover, Pot € has in general a definite value, and therefore 

e=5^Lap Vx6—^New V.c = 0. Q.E.D. 


* Better thus: V.VPot u^f//^V,Vudv==//fV,^'^u^dv-ff/V.^u^^dv+ff/uVy^dv 
- = - 4Tt*. [MS. note by author.] 
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99. To assist the memory of the student, some of the principal 
results of Nos. 93-98 may be expressed as follows: 

Let Wi \ye any solenoidal vector function of position in space, W 2 any 
irrotational vector function, and u any scalar function, satisfying the 
conditions that their potentials have in general definite values. 

With respect to the adenoidal function 0 )^, “ Lap and V x are 
inverse operators; i.e., 

Lap V xcoi “ Lap Wj == . 

Applied to the irrotational function either of these operators gives 
zero; i.e., 

Lapo)2 = (l» Vxtt>2 = 0. 

With respect to the irrotational function or the scalar function u, 
^ New and —V. are inverse operators; i.e., 

— — New V.a)2 = ft>2> — V.^Newu = u. 

Applied to the solenoidal function Wp the operator V. gives zero; i.e. 

V.oji = 0. 

Since the most general form of a vector function having in general 
a definite potential may be written Wi-f-cog, the effect of these operators 
on such a function needs no especial mention. 

With respect to the solenoidal function Wj, Pot and VxVx are 
inverse operators; i.e., 

^ Pot VxVxoJi = Vx;i^ Pot VX= VX VX Pot . 

47r ^ ^ 4fTr ^ ^ 

With respect to the irrotational function (JO 2 , ^ Pot and — VV. are 
inverse operators; i.e., 

—Pot VV.0)0= —V Pot V.6)2= — VV.;j^Potft)2 = <»’9- 

4w 2 ' 4'7r * 47r ^ “ 

With respect to any scalar or vector function having in general a 
definite potential ~ Pot and — V.V are inverse operators; i.e., 

— ^ Pot V.Vu = — V. i Pot Vu = — V.V ^ Pot u = u, 

^ir 47r 

— 4x ~ V.V^— Pot [<*>i d” W 2 J = Wj 

With respect to the solenoidal function Wj, —V.V and VxVx are 
equivalent; with respect to the irrotational function V.V and V^ 
are equivalent; i.e., 

— V.VcDi = VxVxoj,, V.Va)2 = VV.Wg. 
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100. On the interpretation of the preceding formvXce ,—Infinite 
values of the quantity which occurs in a volume-integral as the 
coefficient of the element of volume will not necessarily make the 
value of the integral infinite, when they are confined to certain 
surfaces, lines, or points. Yet these surfaces, lines, or points may 
contribute a certain finite amount to the value of the volume-integral, 
which must be separately calculated, and in the case of surfaces or 
lines is naturally expressed as a surface- or line-integral. Such cases 
are easily treated by substituting for the surface, line, or point, a 
very thin shell, or filament, or a solid very small in all dimensions, 
within which the function may be supposed to have a very large 
value. 

The only cases which we shall here consider in detail are those of 
surfaces at which the functions of position (u or w) are discontinuous, 
and the values of Vu, Vxw, V.w thus become infinite. Let the 
function u have the value on the side of the surface which we 

regard as the negative, and the value on the positive side. Let 
Au^u^ — u^. If we substitute for the surface a shell of very small 
thickness a, within which the value of n varies uniformly as we pass 

through the shell, we shall have Vu = v — within the shell, v denoting 

a unit normal on the positive side of the surface. The elements of 
volume which compose the shell may be expressed by CL[d(T\j where 
[da\ is the magnitude of an element of the surface, da being the 
vector element. Hence, 


Vndv^v An\da\=^ Auda, 

Hence, when there are surfaces at which the values of u are 
discontinuous, the full value of Pot Vu should always be understood 
as including the surface-integral 




da' 


ip-pV 

relating to such surfaces. {An' and da' are accented in the formula 
to indicate that they relate to the point p'.) 

In the case of a vector function which is discontinuous at a surface, 
the expressions V.wdv and Vxaxfv, relating to the element of the 
shell which w^e substitute for the surface of discontinuity, are easily 
transformed by the principle that these expressions are the direct 
and skew surface-integi’als of w for the element of the shell. (See 
Nos. 56, 66.) The part of the surface-integrals relating to the edge 
of the element may evidently be neglected, and we shall have 

V.o) ft)2.(f<r--a)j.ci<rs5s Aw.citr, 

V X w dfv =» X <<>2 ^ X Wj = cZ<r X A(d. 
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Whenever, therefore, w is discontinuous at surfaces, the expressions 
PotV.o) and NewV.w must be regarded as implicitly including the 
surface-integrals 

respectively, relating to such surfaces, and the expressions Pot V x 
and Lap V x co as including the surface-integrals 

respectively, relating to such surfaces. 

101. We have already seen that if w is the curl of any vector 
function of position, V.cej = 0. (No. 68.) The converse is evidently 
true, whenever the equation V.<*) = 0 holds throughout all space, and 
w has in general a definite potential; for then 

(t)s= Vx^ Lapa>. 

Again, if V.o) = 0 within any aperiphractic space A, contained 
within finite boundaries, we may suppose that space to be enclosed by 
a shell B having its inner surface coincident with the surface of A. 
We may imagine a function of position w', such that w'= <*> in A, w = 0 
outside of the shell B, and the integral Jjjw.wdv for B has the least 
value consistent with the conditions that the normal component of w' 
at the outer surface is zero, and at the inner surface is equal to that 
of w, and that in the shell V.w'=0 (compare No. 90). Then V.ft>' = 0 
throughout all space, and the potential of to' will have in general a 
definite value. Hence, 

w' = VLap to', 

and (o will have the same value within the space A. 

tl02. Def .—If CO is a vector function of position in space, the Max¬ 
wellian * of CO is a scalar function of position defined by the equation 

Max CO= ffff^r —^ 3 . to dv', 

(Compare No. 92.) From this definition the following properties are 
easily derived. It is supposed that the functions co and u are such 
that their potentials have in general definite values. 

Max CO = V, Pot CO = Pot V. CO, 

V Max CO = W.Pot CO = New V.co, 

Max Vu = —• 4‘>ru, • 

4*^(0 = VxLap CO — V Max co. 


Aco'.c^cr' 




* The frequent ooourrenoe of the integral in Maxwell’s Treatise on Electricity and 
MagneUt/m has suggested this name. 

t[The foregoing portion of this paper was printed in 1881, the rest in 1884.] 
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If the values of Lap Lap w, New Max a>, and Max New u are in general 
definite, we may add 

4'7r Pot ft) — Lap Lap ft)—New Max ft), 

4^ Pot u = — Max New u. 

In other words: The Maxwellian is the divergence of the potential, 
— a^id V are inverse operators for scalars and irrotational vectors, 
for vectors in general —^V Max is an operator which separates the 
irrotational from the solenoidal part. For scalars and irrotational 
vectors, Max New and New Max give the potential, for sole¬ 
noidal vectors ^ Lap Lap gives the potential, for vectors in general 

New Max gives the potential of the irrotational part, and ^ Lap 

Lap the potential of the solenoidal part. 

103. De/,—The following double volume-integrals are of frequent 
occurrence in physical problems. They are all scalar quantities, and 
none of them functions of position in space, as are the single volume- 
integrals which we have been considering. The integrations extend 
over all space, or as far as the expression to be integrated has values 
other than zero. 

The mutual potential, or ^potential product, of two scalar functions 
of position in space is defined by the equation 

Pot {u, w) ==fff u Pot wdv—ffjw Pot u dv. 

In the double volume-integral, r is the distance between the two 
elements of volume, and u relates to dv as w' to dv'. 

The mutual potential, or potential product, of two vector functions 
of position in space is defined by the equation 

Pot 0 ,) dv M . Pot a, dv . Pot i> dv. 

The mutual Lapladan, or Laplacian product, of two vector 
functions of position in space is defined by the equation 

W(«. .^x,f,'dvdv' 

—MX " • ^. La,pa>dv. 

The Newtonian product of a scalar and a vector function of position 
in space is defined by the equation 

New(w, ft))=B JX/XO *^• ^® 



52 


VECTOR ANALYSIS. 


The Maxwellian 'product of a vector and a scalar function of 
position in space is defined by the equation 

Max (w, u) = fJJfJJ . <*/ dv dv' =u Max o) (Zv = - New (u, w). 

It is of course supposed that u, w, w are such functions of position 
that the above expressions have definite values. 

104. By No. 97, 

4'7^uPot^t?= — V. New It Pot^^;= —V. [New a Pot u’] + New it.New m 
The volume-integral of this equation gives 

4x Pot (u, w)~Jjy'New u. New w dv, 

if the integral 

jyda-. New u Pot w, 

for a closed surface, vanishes when the space included by the surface 
is indefinitely extended in all directions. This will be the case when 
everywhere outside of certain assignable limits the values of u and w 
are zero. 

Again, by No. 102, 

47rft).Pot 0 = VxLap w.Pot <f> — V Max oj.Pot <p 
= V. [Lap toX Pot <p] -f Lap a>. Lap tp 

— V. [Max o) Pot <p\ -h Max co Max <p. 

The volume-integral of this equation gives 

47r Pot {<p, w) =JXfljsjp 0.Lap co cZv -f ffj Max 0 Max w dv, 
if the integrals 

jy <i<r.Lap o) X Pot <p, yyda. Pot 0 Max w, 

for a closed surface vanish when the space included by the surface is 
indefinitely extended in all directions. This will be the case if every¬ 
where outside of certain assignable limits the values of ^ and a> are 
zero. 


CHAPTER III. 

CONCERNING LINEAR VECTOR FUNCTIONS. 

105. Def. —A vector function of a vector is said to be linear, when 
the function of the sum of any two vectors is equal to the sum of the 
functions of the vectors. That is, if 

f unc. [p+/)'] = func. [p] -f f unc. [p'] 

for all values of p and p', the function is linear. In such cases it is 
easily shown that 

func. [ap -f hp 4*cp" -f etc.] = a func. [p] -f h func. [p'] +c func.[p"] etc. 
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106. An expression of the form 

evidently represents a linear function of p, and may be conveniently 
written in the form 

{ aX4-etc.}. p. 

The expression p.aX4-p./3/x4-etc., 

or p.{aX4*^At-hetc.}, 

also represents a linear function of p, which is, in general, different 
from the preceding, and will be called its conjugate. 

107. Def. —An expression of the form aX or jSfi will be called a 
dyad. An expression consisting of any number of dyads united by 
the signs 4* or — will be called a dyadic binoTnial, trinomial, etc., 
as the case may be, or more briefly, a dyadic. The latter term will 
be used so as to include the case of a single dyad. When we desire 
to express a dyadic by a single letter, the Greek capitals will be used, 
except such as are like the Roman, and also A and S. The letter I 
will also be used to represent a certain dyadic, to be mentioned 
hereafter. 

Since any linear vector function may be expressed by means of a 
dyadic (as we shall see more particularly hereafter, see No. 110), the 
study of such functions, which is evidently of primary importance in 
the theory of vectors, may be reduced to that of dyadics. 

108. Def. —Any two dyadics ^ and are equal, 

when p = ■^. p for all values of p, 
or, when p.^ = p.'^ for all values of p, 
or, when (r.$.p = cr.‘^.p for all values of o* and of p. 

The third condition is easily shown to be equivalent both to the first 
and to the second. The three conditions are therefore equivalent. 

It follows that ^ = if <I'.p = 'T'.p, or p.^ = p.^, for three non- 

complanar values of p. 

109. Def. —We shall call the vector $.p the (direct) product of 
# and p, the vector p.^ the (direct) product of p and and the 
scalar <r.^.p the (direct) product of cr, and p. 

In the combination $.p, we shall say that ^ is used as a prefactor, 
in the combination p.^, as a po8tfacto7\ 

110. If T is any linear function of p, and for p = i, p=j, p — k, the 
values of t are respectively a, /3, and y, we may set 


and also 


T^{ai-^^j-\-yk).p, 

p.{ia-\ij^-^ky}. 


Therefore, any linear function may be expressed by a dyadic as 
prefactor and also by a dyadic as postfactor. 
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111. De/—We shall say that a dyadic is multiplied by a scalar, 
when one of the vectors of each of its component dyads is multiplied 
by that scalar. It is evidently immaterial to which vector of any 
dyad the scalar factor is applied. The product of the dyadic $ and 
the scalar a may be written either a# or ^a. The minus sign before 
a dyadic reverses the signs of all its terma 

112. The sign -4- in a dyadic, or connecting dyadics, may be 
regarded as expressing addition, since the combination of dyads and 
dyadics with this sign is subject to the laws of association and 
commutation. 

113. The combination of vectors in a dyad is evidently distributive. 

ifl 

[a + ^ + ctc.] + 


We may therefore regard the dyad as a kind of product of the two 
vectors of which it is formed. Since this kind of product is not 
commutative, we shall have occasion to distinguish the factors as 
antecedent and consequent. 

114. Since any vector may be expressed as a sum of % j, and h with 
scalar coefficients, every dyadic may be reduced to a sum of the nine 

ii, ij, iky jiy jjy jky hj, kk, 


with scalar coefficients. Two such sums cannot be equal according to 
the definitions of No. 108, unless their coefficients are equal each to 
each. Hence dyadics are equal only when their equality can be 
deduced from the principle that the operation of forming a dyad is a 
distributive one. 

On this account, we may regard the dyad as the most general form 
of product of two vectors. We shall call it the indeterminate product. 
The complete determination of a single dyad involves five independent 
scalars, of a dyadic, nine. 

115. It follows from the principles of the last paragraph that if 


then 

and 


2 al3 = 2 icX, 

2aX^ = 2irXX, 

2 a./0 = 2/c.X. 


In other words, the vector and the scalar obtained from a dyadic 
by insertion of the sign of skew or direct multiplication in each dyad 
are both independent of the particular form in which the dyadic is 
expressed. 

We shall write to indicate the vector and the scalar thus 

obtained. 

^ - k.^.j)i + (k,^.i - i.#. k)j + (i.#.,; 

aa i.#.i 4* k.^. k, 
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as is at once evident, if we suppose ^ to be expanded in terms of 
ii, ijy etc. 

116. JDef .—The (direct) product of two dyads (indicated by a dot) 
is the dyad formed of the first and last of the four factors, multiplied 
by the direct product of the second and third. That is, 

=a fi.y S —/3.y aS, 

The (direct) product of two dyadics is the sum of all the products 
formed by prefixing a term of the first dyadic to a term of the second. 
Since the direct product of one dyadic with another is a dyadic, it 
may be multiplied in the same way by a third, and so on indefinitely. 
This kind of multiplication is evidently associative, as well as dis¬ 
tributive. The same is true of the direct product of a series of factors 
of which the first and the last are either dyadics or vectors, and the 
other factors are dyadics. Thus the values of the expressions 

a.^.0.'^./3, a.^.e, #.e.^ 

will not be affected by any insertion of parentheses. But this kind of 
multiplication is not commutative, except in the case of the direct 
product of two vectors. 

117. Def .—The expressions #X/o and px# represent dyadics which 
we shall caD the skew products of # and p. If 

# = aX“l-etc., 

these skew products are defined by the equations 
^xp = a Xxp+^/UXp-fetc., 
pX^ = pXaX4'pX/8 pH-etc. 

It is evident that 

{px^}.^ = px{^.^}, ^.{#Xp} = {^.^}xp, 

{px#}.a = px[^.a], a.{#Xp} =[a.#]xp, 

{pX#}Xa = pX{^Xa}. 

We may therefore write without ambiguity 

pX^.'S^^, '^.^Xp, pX^.a, a.^Xp, pX^.a. 

This may be expressed a little more generally by saying that the 
avssociative principle enunciated in No. 116 may be extended to cases 
in which the initial or final vectors are connected with the other 
factors by the sign of skew multiplication. 

Moreover, 

a.pX^ = [aXp].# and #Xp.a = ^.[pXa], 

These expressions evidently represent vectors. So 
"^.{px^} = {S[^xp}.^. 

These expressions represent dyadica The braces cannot be omitted 
without ambiguity. 
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118. Since all the antecedents or all the consequents in any dyadic 
may be expressed in parts of any three non-complanar vectors, and 
since the sum of any number of dyads having the same antecedent 
or the same consequent may be expressed by a single dyad, it follows 
that any dyadic may be expressed as the sum of three dyads, and so, 
that either the antecedents or the consequents shall be any desired 
non-complanar vectors, but only in one way when either the ante¬ 
cedents or the consequents are thus given. 

In particular, the dyadic 

aii-\-hij -\-cik 
-{-aji ^b'jj -{-c'jk 
-f a'hi -h h"kj -f c'kk. 


which may for brevity be written 

a b 
- a' b' 
a" b" 



is equal to 
where 


and to 
where 


ai-h/3j + yk 

a = ai-\- (ij -f- a"k, 
^ -f h'j -f h"k, 
y = ci -h cj 4- c''k, 

X = Obi ~\~bj -f- ck 
fjL — ai -hck 
V = a"i -h b"j 4- c"k. 


119. By a similar process, the sum of three dyads may be reduced 
to the sum of two dyads, whenever either the antecedents or the 
consequents are complanar, and only in such cases. To prove the 
latter point, let us suppose that in the dyadic 

PfJL + yv 

neither the antecedents nor the consequents are complanar. The 
vector {aX4-^M + y»'}*P 

is a linear function of p which will be parallel to a when p is perpen¬ 
dicular to fi and v, which will be parallel to when p is perpendicular 
to V and X, and which will be parallel to y when p is perpendicular to 
X and p. Hence, the function may be given any value whatever by 
giving the proper value to p. This would evidently not be the case 
with the sum of two dyads. Hence, by No. 108, this dyadic cannot be 
equal to the sum of two dyads. 
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120. In like manner, the sum of two dyads may be reduced to a 
single dyad, if either the antecedents or the consequents are parallel, 
and only in such cases. 

A sum of three dyads cannot be reduced to a single dyad, unless 
either their antecedents or consequents are parallel, or both ante¬ 
cedents and consequents are (separately) complanar. In the first case 
the reduction can always be made, in the second, occasionally. 

121. Bef .—A dyadic which cannot be reduced to the sum of less 
than three dyads will be called complete. 

A dyadic which can be reduced to the sum of two dyads will be 
called planar. When the plane of the antecedents coincides with 
that of the consequents, the dyadic will be called uniplanar. These 
planes are invariable for a given dyadic,*although the dyadic may be 
so expressed that either the two antecedents or the two consequents 
may have any desired values (which are not parallel) within their 
planes. 

A dyadic which can be reduced to a single dyad will be called 
linear. When the antecedent and consequent are parallel, it will be 
called unilinear. 

A dyadic is said to have the value zero when all its terms vanish. 


122. If we set 


Cr = ^.p, T==/3.*i*, 


and give p all possible values, a and r will receive all possible values, 
if ^ is complete. The values of cr and t will be confined each to a 
plane if # is planar, which planes will coincide if ^ is uniplanar. 
The values of cr and t will be confined each to a line if # is linear, 
which lines will coincide if # is unilinear. 

123. The products of complete dyadics are complete, of complete 
and planar dyadics are planar, of complete and linear dyadics are 
linear. 

The products of planar dyadics are planar, except that when the 
plane of the consequents of the first dyadic is perpendicular to the 
plane of the antecedents of the second dyadic, the product reduces 
to a linear dyadic. 

The products of linear dyadics are linear, except that when the 
consequent of the first is perpendicular to the antecedent of the 
second, the product reduces to zero. 

The products of planar and linear dyadics are linear, except when, 
the planar preceding, the plane of its consequents is perpendicular to 
the antecedent of the linear, or, the linear preceding, its consequent is 
perpendicular to the plane of the antecedents of the planar. In these 
cases the product is zero. 

All these cases are readily proved, if we set 
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and consider the limits within which cr varies, when we give p all 
possible values. 

The products and px# are evidently planar dyadics. 

124. Def. —A dyadic $ is said to be an idemfactor, when 

^.p = p for all values of p, 
or when p.$ = p for all values of p. 

If either of these conditions holds true, ^ must be reducible to the 

Therefore, both conditions will hold, if either does. All such dyadics 
are equals by No. 108. They will be represented by the letter I. 

The direct product of an idemfactor with another dyadic is equal to 
that dyadic. That is, 

1 .^ = ^, « = 

where # is any dyadic. 

A dyadic of the form , . o n/ . / 

aa d-pp 4-yy, 

in which a\ /S', y are the reciprocals of a, /?, y, is an idemfactor. 
(See No. 38.) A dyadic trinomial cannot be an idemfactor, unless its 
antecedents and consequents are reciprocals. 

125. If one of the direct products of two dyadics is an idemfactor 
the other is also. For, if = I, 

for all values of <r, and # is complete; 


for all values of <r, therefore for all values of 0 *.$, and therefore 

^.# = 1 . 

Def .—In this case, either dyadic is called the reciprocal of the 
other. 

It is evident that an incomplete dyadic cannot have any (finite) 
reciprocal. 

Reciprocals of the same dyadic are equal. For if $ and are both 


reciprocals of Q, 




If two dyadics are reciprocals, the operators formed by using these 
dyadics as prefactors are inverse, also the operators formed by using 
them as postfactors. 

126. The reciprocal of any complete dyadic 
aX+i8p+y»' 

is \'a -\-vy i 

where a, y are the reciprocals of a, /S, y, and X', p, v axe the 
reciprocals of X, p, v. (See No. 38.) 
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127. Def .—We shall write for the reciprocal of any (complete) 

dyadic also for etc., and for etc. It is 

evident that is the reciprocal of 

128. In the reduction of equations, if we have 

we may cancel the ^ (which is equivalent to multiplying by if 
# is a complete dyadic, but not otherwise. The case is the same with 
such equations as 

To cancel an incomplete dyadic in such cases would be analogous to 
cancelling a zero factor in algebra. 

129. Bef .—If in any dyadic we transpose the factors in each term, 
the dyadic thus formed is said to be conjugate to the first. Thus 

aX + PfjL -f- yv and Xa -h p/S -f- vy 

are conjugate to each other. A dyadic of which the value is not 
altered by such transposition is said to be self-conjugate. The con¬ 
jugate of any dyadic $ may be written #o* is evident that 

p.#=#o-/> #.p = p.#o* 

p fl'i'e conjugate functions of p. (See No. 106.) Since 

= we may write ®t«., without ambiguity. 

130. The reciprocal of the product of any number of dyadics is 
equal to the product of their reciprocals taken in inverse order. Thus 

The conjugate of the product of any number of dyadics is equal to 
the product of their conjugates taken in inverse order. Thus 

Hence, since = = 

and we may write without ambiguity* 

131. It is sometimes convenient to be able to express by a dyadic 
taken in direct multiplication the same operation which would be 
effected by a given vector (a) in skew multiplication. The dyadic 
Ixa will answer this purpose. For, by No. 117, 

{IXa}.p==aXp, p.{IXa}=pXa, 

{Ixa}.^ = aX^, #.{Ixa}~^Xa. 

The same is true of the dyadic ax I, which is indeed identical with 
Ixa, as appears from the equation I.{axl} = {Ixa}.I. 
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If a is a unit vector, 

{Ixa{^= — {I —Ota}, 

{Ixa}3= —Ixa, 

{Ixa}" = I-^aa, 

{Ixap = IXa, 
etc. 

If i, j, k are a normal system of unit vectors 
Ixi ='ixl=^* —jk, 

Ixj =j xl = ik — ki, 
lxk = kxl~ji ‘-■‘ij. 

If a and are any vectors, 

[ax/3]xl = Ix[aX^] = — a/3. 

That is, the vector ax/3 as a pre- or post-factor in skew multipli¬ 
cation is equivalent to the dyadic {/3aa/3} taken as pre- or post- 
factor in direct multiplication. 

[aX0]xp= {I3a-a^}.p, 
px[ax^]==p.{^a-a^}. 

This is essentially the theorem of No. 27, expressed in a form more 
symmetrical, and more easily remembered. 

132. The equation 

a^Xy -f /3yxa -f yaX/3= a./3xyI , 
gives, on multiplication by any vector p, the identical equation 

p.a ^Xy -f- p./3 yXa + p y aX/3 —a./3xy p. 

(See No. 37.) The former equation is therefore identically true. 
(See No. 108.) It is a little more general than the equation 

aa' -f- /3/3' -f yy' = I, 

which we have already considered (No. 124), since, in the form here 
given, it is not necessary that a, j3y and y should be non-complanar. 
We may also write 

/3x y a + y Xa/3-h aX/3 y = a./3xy I. 

Multiplying this equation by p as prefactor (or the first equation by 
p as postfactor), we obtain 

p./3xy a + p.yxa^ + p.ax/3 y = a./3xy p. 

(Compare No. 37.) For three complanar vectors we liave 
a/3xy -f fiyxa -f yaX/3 — 0. 

Multiplying this by r, a unit normal to the plane of a, 0, and y 
we have 


apXy.v -f /Syxa.r -f yaX/3.i/=0, 
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This equation expresses the well-known theorem that if the geometrical 
sum of three vectors is zero, the magnitude of each vector is propor¬ 
tional to the sine of the angle between the other two. Tt also indicates 
the numerical coeflBcients by which one of three complanar vectors may 
be expressed in parts of the other two. 

133. Def .—If two dyadics # and are such that 

they are said to be homologous. 

If any number of dyadics are homologous to one another, and any 
other dyadics are formed from them by the operations of taking 
multiples, sums, differences, powers, reciprocals, or products, such 
dyadics will be homologous to each other and to the original dyadics. 
This requires demonstration only in regard to reciprocals. Now if 

That is, is homologous to if $ is. 

134. If we call or the quotient of ^ and we may 

say that the rules of addition, subtraction, multiplication and division 
of homologous dyadics are identical with those of arithmetic or 
ordinary algebra, except that limitations analogous to those respecting 
zero in algebra .must be observed with rcvspect to all incomplete 
dyadics. 

It follows that the algebraic and higher analysis of homologous 
dyadics is substantially identical with that of scalars. 

135. It is always possible to express a dyadic in three terms, so 
that both the antecedents and the consequents shall be perpendicular 
among themselves. 

To show this for any dyadic let us set 

p being a unit-vector, and consider the different values of p' for all 
possible directions of p. Let the direction of the unit vector i be so 
determined that when p coincides with i, the value of p shall be at 
least as great as for any other direction of p. And let the direction 
of the unit vector be so determined that when p coincides with j, 
the value of p' shall be at least as great as for any other direction of p 
which is perpendicular to i. Let k have its usual position with 
respect to i and j. It is evidently possible to express # in the form 

We have therefore 


and 


p'= *fyA;}.p, 

dp ^ {ai-^Pj-^yk),dp. 
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Now the supposed property of the direction of i requires that when p 
coincides with i and dp is perpendicular to i, dp' shall be perpendicular 
to p', which will then be parallel to a. But if dp is parallel to j or k, 
it will be perpendicular to i, and dp' will be parallel to /S or y, as the 
case may be. Therefore and y are perpendicular to a. In the same 
way it may be shown that the condition relative to j requires that y 
shall be perpendicular to We may therefore set 
# = ai'i + l^'j 4- cJ<fkj 

where i", /, k', like i, j, k, constitute a normal system of unit vectors 
(see No. 11), and a, b, c are scalars which may be either positive or 
negative. 

It makes an important difference whether the number of these 
scalars which are negative is even or odd. If two are negative, say a 
and 6, we may make them positive by reversing the directions of i' 
and j'. The vectors t', /, k' will still constitute a normal system. 
But if we should reverse the directions of an odd number of these 
vectors, they would cease to constitute a normal system, and to be 
superposable upon the system i, jy We may, however, always set 

^ = ai'i -f 4- ck'k, 

or $ = — { ai'i 4- bj'j 4- ck'k }. 

with positive values of a, h, and c. At the limit between these cases 
are the planar dyadics, in which** one of the three terms vanishes, and 
the dyadic reduces to the form 

ai'i + I^'j, 


in which a and b may always be made positive by giving the proper 
directions to i' and /, 

If the numerical values of a, b, c are all unequal, there will be only 
one way in which the value of # may be thus expressed. If they are 
not all unequal, there will be an infinite number of ways in which # 
may be thus expressed, in all of which the three scalar coeflBcients 
will have the same values with exception of the changes of signs 
mentioned above. If the three values are numerically identical, we 
may give to either system of normal vectors an arbitrary position. 

136. It follows that any self-conjugate dyadic may be expressed 

in the form aii+hjj+ckk, 

where t, j, k are a normal system of unit vectors, and a, 6, c are 
positive or negative scalars. 

137. Any dyadic may be divided into two parts, of which one shall 
be self-conjugate, and the other of the form lx a. These parts are 
found by taking half the sum and half the difference of the dyadic 
and its conjugate. It is evident that 
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Now is self-conjugate, and 

(See No. 131.) 

Rotations and Strains. 

138. To illustrate the use of dyadics as operators, let us suppose 
that a body receives such a displacement that 

p=^.p, 

p and p' being the position-vectors of the same point of the body in 
its initial and subsequent positions. The same relation will hold of 
the vectors which unite any two points of the body in their initial 
and subsequent positions. For if p^, p^ are the original position- 
vectors of the points, and pj', p^' their final position-vectors, we have 

p,' = #.p2, 

whence , / ^ r t 

P2 -pi =4».[p2~PiJ- 

In the most general case, the body is said to receive a homogeneous 
strain. In special cases, the displacement reduces to a rotation. 
Lines in the body initially straight and parallel will be straight and 
parallel after the displacement, and surfaces initially plane and 
parallel will be plane and parallel after the displacement. 

139. The vectors (<t, a) which represent any plane surface in the 
body in its initial and final positions will be linear functions of each 
other. (This will appear, if we consider the four sides of a tetra¬ 
hedron in the body.) To find the relation of the dyadics which 
express <r' as a function of or, and p as a function of p, let 

p = {aX + /5/M4-y»'}.p. 

Then, if we write X', /x', v' for the reciprocals of X, /x, v, the vectors 
X', fi, V become by the strain a, y. Therefore the surfaces ya'xv, 
vyX, X'Xfji' become ^xy, yxa, ax/B. But p,'xv\ vxX, Xx/ul are the 
reciprocals of /xXr, i/xX, Xxya. The relation sought is therefore 

(r'= {^Xy julXv + yXa vxX + aX^ XX/tx}.or. 

140. The volume X.pxv becomes by the strain a.^xy. The 
unit of volume becomes therefore {a.^xy)(X.p.xu). 

Def ,—It follows that the scalar product of the three antecedents 
multiplied by the scalar product of the three consequents of a dyadic 
expressed as a trinomial is independent of the particular form in 
which the dyadic is thus expressed. This quantity is the determinant 
of the coefficients of the nine terms of the form 

aii-|-6i^'-fetc., 
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into which the dyadic may be expanded. We shall call it the 
determinant of the dyadic, and shall denote it by the notation 

m 

when the dyadic is expressed by a single letter. 

If a dyadic is incomplete, its determinant is zero, and conversely. 
The determinant of the product of any number of dyadics is 
equal to the product of their determinants. The determinant of the 
reciprocal of a dyadic is the reciprocal of the determinant of that 
dyadic. The determinants of a dyadic and its conjugate are equal. 

The relation of the surfaces a-' and o- may be expressed by the equation 

141. Let us now consider the different cases of rotation and strain 
as determined by the nature of the dyadic 

If # is reducible to the form 

i'i +jj -h Jck, 

k, i',j\ k' being normal systems of unit vectors (see No. 11), the 
body will suffer no change of form. For if 

p = xi + yj+zk, 

we shall have 

p=:xi'-hyj'-\-zk'. 

Conversely, if the body suffers no change of form, the operating 
dyadic is reducible to the above form. In such cases, it appears from 
simple geometrical considerations that the displacement of the body 
may be produced by a rotation about a certain axis. A dyadic 
reducible to the form 

i'i-^j'j-hk'k 

may therefore be called a versor. 

142. The conjugate operator evidently produces the reverse rotation. 
A versor, therefore, is the reciprocal of its conjugate. 

Conversely, if a dyadic is the reciprocal of its conjugate, it is either 
a versor, or a versor multiplied by —1. For the dyadic may be 
expressed in the form 

Its conjugate will be 

If these are reciprocals, we have 

{ ai 4- ySj + yA;}. { ia + ifcy} = ao + 13/3 -f yy = I. 

But this relation cannot subsist unless a, /8, y are reciprocals to 
themselves, i.e., unless they are mutually perpendicular unit-vectors. 
Therefore, they either are a normal system of imit-vectors, or will 
become such if their directions are reversed. Therefore, one of the 
dyadics 

o'i + + yk and — ai—— yA; 

is a versor. 


[See note on p. 90.] 
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The criterion of a versor may therefore be written 

For the last equation we may substitute 

|#|>0, or 1^15 —1- 

It is evident that the resultant of successive finite rotations is 
obtained by multiplication of the versors. 

143. If we take the axis of the rotation for the direction of 
i' will have the same direction, and the versor reduces to the form 

in which i, j, k and i, j\ k' are normal systems of unit vectors. 

We may set 

/ = cos qj + sin q k, 
k' = cos qk — sin q j,- 

and the versor reduces to 


ii 4- cosg (jj+kk) + sing (kj-^jk], 
or 

ii 4- cos q {I —-ii} 4- sin q Ixi, 

where q is the angle of rotation, measured from j toward A;, if the 
versor is used as a prefactor. 

144. When any versor 4? is used as a prefactor, the vector 
will be parallel to the axis of rotation, and equal in magnitude to 
twice the sine of the angle of rotation measured counter-clockwise as 
seen from the direction in which the vector points. (This will appear 
if we suppose to be represented in the form given in the last 
paragraph.) The scalar will be equal to unity increased by twice 
the cosine of the same angle. Together, — ^d determine the 
versor without ambiguity. If we set 


the magnitude of 0 will be 


14 -^ 8 * 


2 sing 
24-2 cos g 


or tan Jg, 


where g is measured counter-clockwise as seen from the direction in 
which 0 points. This vector 0, which we may call the vector aemi- 
tangent of version, determines the versor without ambiguity. 

145. The versor ^ may be expressed in terms of B in various ways. 
Since # (as prefactor) changes a —0Xa into a4-0Xa (a being any 
vector), we have 

$={I4-Ixe}.{l~lxe}“^ 

Again 

^ 004-{I+Ixap (l~a^)I+2064-21x6 

* i+e.e i+0.d 
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as wiU be evident on considering separately in the expression 
the components perpendicular and parallel to 6, or on substituting in 

ii + cos q (jj+kk) + sin q (kj —jk) 

for cos q and sin q their values in terms of tan ^q. 

If we set, in either of these equations, 

0 = ai-^hj-^ck, 

we obtain, on reduction, the formula 

( (16*—c*)^^4•(2a6--2c)^Ji4-(2cw;■f26)^A; ] 

-! -f (2a6 4- 2c)ji 4- (1 — •+■ 6®— c^)jj 4- (26c ~ 2a)jk [ 

( 4-(2ac —26)H4-(26c4-2a)^^'4-(l “-a® — 6®4“C®)A;A: j 
l4-a®4-6®4-c2 

in which the versor is expressed in terms of the rectangular com¬ 
ponents of the vector semitangent of version. 

146. If a, /3, y are unit vectors, expressions of the form 

2aa —I, 2^|5—I, 2yy —I, 

are biquadrantal versors. A product like 

{2^/3~I}.{2aa~I} 

is a versor of which the axis is perpendicular to a and /3, and the 
amount of rotation twice that which would carry a to jS. It is 
evident that any versor may be thus expressed, and that either a or /8 
may be given any direction perpendicular to the axis of rotation. If 

^={2/3/3~I}.{2aa-I}, and {2yy-I}.{2/3/9~I}, 

we have for the resultant of the successive rotations 


= { 2yy -1}. { 2aa -1}. 


This may be applied to the composition of any two successive 
rotations, /3 being taken perpendicular to the two axes of rotation, 
and affords the means of determining the resultant rotation by 
construction on the surface of a sphere. It also furnishes a simple 
method of finding the relations of the vector semitangents of version 
for the versors and Let 


Then, since 




! 4 a.— 2aa — 2y8/3 4-1, 

axp 

’ a. 


which is moreover geometrically evident In like manner, 
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Therefore, 


(See No. 38.) 


Also, 

Hence, 


a vfl ^[°x ^3x[^xy3 nx^.y^ 

_ j8.ai8xy+j8.j8yXa+/3.y aXjS 
“ a.jS^.y 

That is, 

= + ^ 1 - 

a.^/8.y ■“ a.;8/3.y' 

^iXO2 = 62-(1~0x.02)^3 + <^i> 

^ 4 “ ^2 

> 


which is the formula for the composition of successive finite rotations 
by means of their vector semitangents of version. 

147. The versors just described constitute a particular class under 
the more general form 

aa 4* cos q -f yy'} 4- sin q{y^' — By ), 


in which a, /S, y are any non-complanar vectors, and a\ /8', y their 
reciprocals. A dyadic of this form as a prefactor does not affect any 
vector parallel to a. Its effect on a vector in the y8-y plane will be 
best understood if we imagine an ellipse to be described of which 
B and y are conjugate semi-diameters. If the vector to be operated 
on be a radius of this ellipse, we may evidently regard the ellipse 
with B* other vector, as the projections of a circle with two 

perpendicular radii and one other radius. A little consideration will 
show that if the third radius of the circle is advanced an angle g, its 
projection in the ellipse will be advanced as required by the dyadic 
prefactor. The effect, therefore, of such a prefactor on a vector in the 
B-y plane may be obtained as follows : Describe an ellipse of which 
B and y are conjugate semi-diameters. Then describe a similar and 
similarly placed ellipse of which the vector to be operated on is a 
radius. The effect of the operator is to advance the radius in this 
ellipse, in the angular direction from B toward y, over a segment 
which is to the total area of the ellipse as g is to 2ir. When used as 
a postfactor, the properties of the dyadic are similar, but the axis of 
no motion and the planes of rotation are in general different. 

])ef. —Such dyadics we shall call cyclic. 

The Nth power (N being any whole number) of such a dyadic is 
obtained by multiplying q by N. If q is of the form 2xN/M (N and 
M being any whole nunpibers) the Mth power of the dyadic will be an 
idemfactor. A cyclic dyadic, therefore, may be regarded as a root of 
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I, or at least capable of expression with any required degree of 
accuracy as a root of I. 

It should be observed that the value of the above dyadic will not 
be altered by the substitution for a of any other parallel vector, or 
for /8 and y of any other conjugate semi-diameters (which succeed one 
another in the same angular direction) of the same or any similar and 
similarly situated ellipse, with the changes which these substitutions 
require in the values of a\ y. Or, to consider the same changes 
from another point of view, the value of the dyadic will not be altered 
by the substitution for a of any other parallel vector or for fi' and y 
of any other conjugate semi-diameters (which succeed one another in 
the same angular direction) of the same or any similar and similarly 
situated ellipse, with the changes which these substitutions require in 
the values of a, /3, and y, defined as reciprocals of a, y- 

148; The strain represented by the equation 

p = {aii-\-hjj ckk), p 

where a, b, c are positive scalars, may be described as consisting of 
three elongations (or contractions) parallel to the axes i, j, k, which 
are called the principal axes of the strain, and which have the pro¬ 
perty that their directions are not affected by the strain. The scalars 
a, h, c are called the principal ratios of elongation, (When one of 
these is less than unity, it‘represents a contraction.) The order of 
the three elongations is immaterial, since the original dyadic is equal 
to the product of the three dyadics 

aii -^jj -f- ii 4- hjj -f kk, ii -f jb -f ckk 

taken in any order. 

Def —A dyadic which is reducible to this form we shall call a right 
tensor. The displacement represented by a right tensor is called a 
p^i/re strain. A right tensor is evidently self-conjugate. 

149. We have seen (No. 135) that every dyadic may be expressed 
in the form 

±:{ai%-^hj'j + ck'k), 

where a, 5, c are positive scalars. This is equivalent to 
:± { ai'i' -f hff -f* ck'k' }.{i'i 4-/b -h k'k] 

and to 

±{i'i '^rk'k).(aii-^hjj + ckk}, 

Hence every dyadic may be expressed as the product of a versor and 
a right tensor with the scalar factor zt 1. The versor may precede or 
follow. It will be the same versor in either case, and the ratios of 
elongation will be the same; but the position of the principal axes 
of the tensor will differ in the two cases, either system being derived 
from the other by multiplication by the versor. 
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Def .— The displacement represented by the equation 

P = -P 

is called inversion. The most general case of a homogeneous strain 
may therefore be produced by a pure strain and a rotation with or 
without inversion. 

150. If 

^ = ai'i -f hjj + ck'k, 

== ci^i'i' + ■+■ c^^'k\ 

and =^a^ii + h^jj + kk. 

The general problem of the determination of the principal ratios and 
axes of strain for a given dyadic may thus be reduced to the case of 
a right tensor, 

151. Def .—The effect of a prefactor of the form 

ciaa + -h ^yy > 

where a, b, c are positive or negative scalars, a, /^, y non-complanar 
vectors, and a', y their reciprocals, is to change a into aa, ^ into 
6/3, and y into cy. As a postfactor, the same dyadic will change a' 
into aa\ ^ into 6/3', and y into cy', Dyadics which can be reduced to 
this form we shall call tonic (Gr. Telvw). The right tensor already 
described constitutes a particular case, distinguished by perpendicular 
axes and positive values of the coefficients a, 6, c. 

The value of the dyadic is evidently not affected by substituting 
vectors of different lengths but the same or opposite directions for 
/3, y, with the necessary changes in the values of a, /3', y\ defined 
as reciprocals of a, y. But, except this change, if a, 6, c are 
unequal, the dyadic can be expressed only in one way in the above 
form. If, however, two of these coefficients are equal, say a and 6, 
any two non-collinear vectors in the a-/3 plane may be substituted 
for tt and /3, or, if the three coefficients are equal, any three non- 
complanar vectors may be substituted for a, y. 

152. Tonics having the same axes (determined by the directions 
of a, /3, y) are homologous, and their multiplication is effected by 
multiplying their coefficients. Thus, 

{(Xj aa -f 6i/3/3'-f c^yy }.{a^aa-\- -fc,yy'} 

= {a^a^aa -f *f c^c^yy }. 

Hence, division of such dyadics is effected by division of their co¬ 
efficients. A tonic of which the three coefficients a, 6, c are unequal, 
is homologous only with such dyadics as can be obtained by varying 
the coefficients. 

153. The effect of a prefactor of the form 

aaa 4- -f c{y^' —/3y'}, 

or aaa -f p cos g 4- yy'} 4- p sin ^ {y^' — ^y }, 
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where a', ^8', y are the reciprocals of a, y, and a, 6, c, p, and q are 
scalars, of which p is positive, will be most evident if we resolve it 
into the factors 

aaa'-f +yy', 
aa +pyy', 

aa + cos g{ /S/S'+yy'} + sin g'{y/S'—/Sy'}, 

of which the order is immaterial, and if we suppose the vector on 
which we operate to be resolved into two factors, one parallel to a, 
and the other in the /3-y plane. The effect of the first factor is to 
multiply by a the component parallel to a, without affecting the other. 
The effect of the second is to multiply by p the component in the 
/8-y plane without affecting the other. The effect of the third is to 
give the component in the ^-y plane the kind of elliptic rotation 
described in No. 147. 

The effect of the same dyadic as a postfactor is of the same nature. 

The value of the dyadic is not affected by the substitution for a of 
another vector having the same direction, nor by the substitution for 
13 and y of two other conjugate semi-diameters of the same or a 
similar and similarly situated ellipse, and which follow one another 
in the same angular direction. 

Def. —Such dyadics we shall call cyclotonic, 

154. Cyclotonics which are reducible to the same form except 
with respect to the values of a, p, and q are homologous. They are 
multiplied by multiplying the values of a, and also those of p, and 
adding those of q. Thus, the product of 

Ojoa' + coe 9, {/ 3 ^+ yy'} +1?, sin g, {y/S' - ^y } 
and a^aa' + +yy'} +JPj8mg'2{y/8'-/3y'} 

is a^a^aa' + PiPi<x>s{q^-\-q^{PP + yy') 

sin (g, +q^{y^- ^y). 

A dyadic of this form, in which the value of q is not zero, or the 
product of TT and a positive or negative integer, is homologous only 
with such dyadics as are obtained by varying the values of a,jp, and q, 

156. In general, any dyadic may be reduced to the form either of a 
tonic or of a cyclotonic. (The exceptions are such as are made by 
the limiting cases.) We may show this, and also indicate how the 
reduction may be made, as follows. Let ^ be any dyadic. We have 
first to show that there is at least one direction of p for which 

^.p^ap. 

This equation, is equivalent to 

or, {#-aI}.p = 0. 
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That is, #—al is a planar dyadic, 'which may be expressed by the 
equation |$-al|=0. 

(See No. 140.) Let $ = 

the equation becomes 

|[X —ai]i+[/z——aA;]A;| =0, 
or, [X —at]X [ya —ay], [v —aA:] = 0, 

or, a’^ —(t.X-fy.y«4-A;.v)a*+(t.ftxi/4-y.»'xX + fc.XX)ti)a~Xx/M.v = 0. 

This may be written 

a8~#fla2+{#-i}g|^|a-|^l = 0.* 

Now if the dyadic $ is given in any form, the scalars 

^S> 1^1 

are easily determined We have therefore a cubic equation in a, for 
which we can find at least one and perhaps three roots. That is, we 
can find at least one value of a, and perhaps three, which will satisfy 
the equation l$-all=0. 

By substitution of such a value, # —al becomes a planar dyadic, the 
planes of which may be easily determined.! Let a be a vector 
normal to the plane of the consequents. Then 

al}.a = 0, 
a = aa. 

If ^ is a tonic, we may obtain three equations of this kind, say 
#.a = aa, $.^ = 6/8, ^.y = cy, 

in which a, /S, y are not complanar. Hence (by No. 108), 

# = aaa q- 6^^' -f Cyy', 

where a', /S', y are the reciprocals of a, /S, y. 

In any case, we may suppose a to have the same sign as 1^|, since 
the cubic equation must have such a root. Let a (as before) be 
normal to the plane of the consequents of the planar $ —al, and a 
normal to the plane of the antecedents, the lengths of a and a being 
such that a . a' = 14 Let /3 be any vector normal to a', and such that 
is not parallel to /3. (The case in which ^,/8 is always parallel 
to /S, if /S is perpendicular to a', is evidently that of a tonic, and needs 
no farther discussion.) aI}./S and therefore will be per» 

pendicular to a'. The same will be true of Now (by No. 140) 

[^.a].[^2./S]x[§^./S] = |^l«.[^-/3]xA 

that is, aa.[^2/Q]x[^./S] = |^la,[#./8]x/8. 


* [See note on p. 90. ] 

t In particular oases, t — al may reduce to a linear dyadic, or to 2 .ero. These, how¬ 
ever, will present no difficulties to the student. 

J For the case in which the two planes are perpendicular to each other, see No. 167. 
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Hence, since and [^.| 8 ]x/? are parallel, 

a[^^. 5] X = 1^ I [^. l3]x/3. 

Since is positive, we may set 

If we also set 

etc., 

/S_ 2 =P“#'®etc., 

the vectors /3, /Sj, jSg. etc., /?_i, / 3 _ 2 , etc., will all lie in the plane 
perpendicular to a, and we shall have 

^2X/3i = /3jX/?, 

[ft+/3]xA = 0 - 

We may therefore set ^2 4- = 2n/Bi . 

Multiplying by and by 

/^g 4- /3i = 2'7^^2, -f = 2n/9s» ’ 

/3x4-i8_i = 2ii/3, / 34 -i 8 - 2 == 2 ??^_p etc. 

Now, if n > 1 , and we lay off* from a common origin the vectors 
/3, ^ 2 ' ^- 1 , / 5 - 2 , etc., 

the broken line joining the termini of these vectors will be convex 
toward the origin. All these vectors must therefore lie between two 
limiting lines, which may be drawn from the origin, and which may 
be described as having the directions of and A vector 

having either of these directions is unaffected in direction by 
multiplication by In this case, therefore, ^ is a tonic. If n < — 1 
we may obtain the same result by considering the vectors 

/3, -/3,, /Sj. -/Ss, /S,, etc., -/3-a. etc., 

except that a vector in the limiting directions will be reversed in 
direction by multiplication by which implies that the two 
corresponding coefficients of the tonic are negative. 

If 1 > 71 > — l,t we may set 

71= cosg. 

Then /3_iH-/8i = 2 cos ( 7 /3. 

Let us now determine y by the equation 

^1 = cos 5 ^ ^4-singly. 

This gives — cosg/S —singly. 

Now a" is one of the reciprocals of a, and y. Let ^ and y be the 
others. If we set 

^ = cos g 4 * yy'} 4 - sin q{y^ — ^y), 

we have *^.0 = 0 , S [^./8 = / 8 i, = 


* The termini of the vectors will in fact lie on a hyperbola, 

+ For the limiting oases, in which n = 1, or n= -1, see No. 156. 
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Therefore, since {aaa }. a = <x« = ^, a, 

{aaa — 

{ a a a 4-} . /3 _ 1 = 2^/3 = ^ . /S , 1 , 
it follows (by No. 108) that 

# = aaa'-f-2)'SP‘ = aaa'-f 2>cosg{/3^'4-yy'} 4-psing{y|8' —/3y'}. 

156. It will be sufficient to indicate (without demonstration) the 
forms of dyadics which belong to the particular cases which have 
been passed over in the preceding paragraph, so far as they present 
any notable peculiarities. 

If 7? = dbl (page 72), the dyadic may be reduced to the form 
aaa 6 {/3/^ + yy } -}- bc/3y', 

where a, /3, y are three non-coraplanar vectors, a', /S', y their reci¬ 
procals, and a, h, c positive or negative scalars. The effect of this as an 
operator, will be evident if we resolve it into the three homologous 
factors 

aaa 4- /3/3' 4- yy y 
aa! H“yy }. 

aa 4" /S/8' 4* yy 4- <^/3y'. 

The displacement due to the last factor niay be called a simple shear. 
It consists (when the dyadic is used as prefactor) of a motion parallel 
to /3, and proportioned to the distance from the a-/8 plane. This 
factor may be called a shearer. 

This dyadic is homologous with such as are obtained by varying 
tlie values of a, 6, c, and only with such, when the values of a and h 
are different, and that of c other than zero. 

157. If the planar ^ —al (page 71) has perpendicular planes, there 
may be another value of a, of the same sign as |^|, which will give a 
planar which has not perpendicular planes. . When this is not the 
case, the dyadic may always be reduced to the form 

a\aa -\-yy) -\-ah{a^'^y) -{~aeay\ 

where a, /3, y are three non-complanar vectors, a\ y\ their reci¬ 
procals, and a, 5, c, positive or negative scalars. This may be resolved 
into the homologous factors 

al and l4“^{a/9'4-/3y'} 4-<?ay'. 

The displacement due to the last factor may be called a complex shear. 
It consists (when the dyadic is used as prefactor) of a motion parallel 
to a which is proportional to the distance from the a-y plane, together 
with a motion parallel to 5/8 4* ca which is proportional to the distance 
from the a-/8 plane. This factor may be called a complex shearer 
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This dyadic is homologous with such as are obtained by varying 
the values of a, 6, c, and only such, unless 6 = 0. 

It is always possible to take three mutually perpendicular vectors 
for a, and y ; or, if it be preferred, to take such values for these 
vectors as shall make the term containing c vanish. 

158. The dyadics described in the two last paragraphs may be 
called shearing dyadics. 

The criterion of a shearer is 

The criterion of a simple shearer is 

The criterion of a complex shearer is 
{#-I}3 = 0, 


Note. —If a dyadic is a linear function of a vector p (the term linmr being used in 

the same sense as in No. 105), we may represent the relation by an equation of the form 

4»=a/37.p-f ei" i7./>-f*etc., 

or »l> = {a ;37 + cfi? 4 -etc. 

where the expression in the braces may be called a triadic •polynxymiol^ and a single term 
a/Sy a irtod, or the indeterminate product of the three vectors a, |3, 7 . We are thus led 
successively to the consideration of higher orders of indeterminate products of vectors, 
triads, tetrads, etc,, in general pdyads, and of polynomials consisting of such terms, 
triadics, tetradics, etc., in general polyadics. But the development of the subject in 
this direction lies beyond our present purpose. 

It may sometimes be convenient to use notations like 

\a, /S, 7 a, ft, y\ 

to represent the conjugate dyadics which, the first as prefactor, and the second as 
postfaotor, change a, ft, y into X, /u., v, respectively. In the notations of the preceding 
chapter these would be written 


jf \a*fift’vy' and + ft'fi-^y'v 

respectively, a', ft', y' denoting the reciprocals of a, ft, y. If r is a linear function 
of p, the dyadics which as prefactor and postfactor change p into t may be written 
respectively 

f- and 
\P Pi 

If r is any function of p, the dyadics which as prefactor and postfactor change dp into 
dr may be written respectively 


dr 

\dp 


and 


dr 

dpi 


In the notation of the following chapter the second of these (when p denotes a posi¬ 
tion-vector) would be written Vr. The triadic which as prefactor changes dp into 


may be written which as postfactor changes dp into ^ may be 

written it. The latter would be written Wr in the notations of the following chapter. 
»P*1 
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CHAPTER IV. 


(Supplementary to Chapter II.) 

CONCERNING THE DIFFERENTIAL AND INTEGRAL CALCULUS OF VECTORS. 


159, If ic is a vector having continuously varying values in spa^e, 
and p the vector determining the position of a point, we may set 

p^xi-\~yj-itzh, 

dp=^dxi-{‘dyj + dzlCy 


and regard w as a function of p, or of y, and z. Then, 



7 j d(t> , 7 do) , j do) 

that is, 

j j f • dw . dct) , 7 dw 

If we set 

. . do) , . do) . 7 du) 



Here V stands for 

. d . d ^ j d 


exactly as in No. 52, except that it is here applied to a vector and 
produces a dyadic, while in the former case it was applied to a 
scalar and produced a vector. The dyadic Vw represents the nine 
differential coefficients of the three components of a> with respect to 
aj, y, and 0 , just as the vector Vu (where u is a scalar function of p) 
represents the three differential coefficients of the scalar u with 
respect to x, y, and z. 

It is evident that the expressions V.w and Vxft> already defined 
(No. 54) are equivalent to {Voj}g and {Vaj}x. 

160. An important case is that in which the vector operated on is 
of the form Vu. We have then 


where 

( 


Wu = J 


dVu = cip.VVu, 
c?*u .. . dhi .. . dSi 


dxdy^^ 
dhb 


dx^ 

, (f^u , _ ^ 

^ Ay daP^ dy^^^ 

(i*u , . <i®u 7 . 


dxdz 

(Pu 

dydz 

dhi 

dz^ 


ik 




kk. 




This dyadic, which is evidently self-conjugate, represents the six 
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differential coefficients of the second order of Vb with respect to x, y, 
and z* 

161. The operators Vx and V. may be applied to dyadics in a 
manner entirely analogous to their use with scalars. Thus we may 
define Vx# and V.# by the equations 


Then, if 


Or, if 




"dz 


Vx# 


v$=i 

dy ' dz 

# = ai-f y6[;4-yA;, 

Vx#=Vxai-f Vx/8i +Vxy^•. 

V.# — V. flt V. -f- V.yk. 

# = ta H- hy, 

=i _ ?'/?!+,• pi« _ '^vi+1 

Ldy (izj^^Ldz dxA^ Ldx dyj 


dx dy dz 


162. We may now regard V.V in expressions like V.Vw as repre¬ 
senting two successive operations, the result of which will be 


d^ii) 

dx^ dy^ dz^ 


in accordance with the definition of No. 70. We may also write 
V.V# for 

d^.d^ (m 

dx- ^ (ly^ dz^ ’ 


although in this case we cannot regard V.V as representing two 
successive operations until we have defined V#.t 

That V.V# = VV.# —VxVx# will be evident if we suppose # to 
be expressed in the form ai + l3j-{-yk. (See No. 71.) 

163. We have already seen that 

u"-u=:/dp.Vu, 

where u' and u" denote the values of u at the beginning and the end 
of the line to which the integral relates. The same relation will hold 
for a vector; i.e., 

—J'dp.Vot), 


* We might proceed to higher steps in differentiation by means of the triadios Ww, 
VVVw, the tetradics VWw, v^WV«, etc. See note on page 74. In like manner a dyfuiic 
function of position in space (<f>) might be differentiated by means of the triadic V#, the 
tetradio etc. 
t See footnote to No. 160. 
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164. The following equations between surface-integrals for a closed 
surface and volume-integrals for the space enclosed seem worthy of 
mention. One or two have already been given, and are here repeated 
for the sake of comparison. 


ffda-u=fffdvVu, 

(1) 

JfdrT w=fffdvVu!, 

(2) 

ffdcr. ft) = fffdv V. < 0 . 

(3) 

ffd(T.^=fffdvV.^, 

(4) 

ffd<TXa> =ff/'dvVxu), 

(5) 

ffd<TX^=fffdvVx^. 

(6) 


It may aid the memory of the student to observe that the transfor¬ 
mation may be effected in each case by substituting fffdv V for ff da. 

165. The following equations between line-integrals for a closed 
line and surface-integrals for any surface bounded by the line, may 
also be mentioned. (One of these has already been given. See 
No. 60.) 


fdp daxVuy 

(1) 

Jdp a)=^jydaxVu)y 

(2) 

fdp.io^ffda.Vxw, 

(3) 

fdp .^—ffda.Vx^, 

(4) 

fdpXiA^ff^m.da—fJ daV.tc. 

(5) 


These transformations may be effected by substituting J] [da x V] for 
f dp. The brackets are here introduced to indicate that the multi¬ 
plication of da with the i,jy k implied in V is to be performed before 
any other multiplication which may be required by a subsequent sign. 
(This notation is not recommended for ordinary use, but only sug¬ 
gested as a mnemonic artifice.) 

166. To the equations in No. 65 may be added many others, as, 


= Vu (1) 

V[TXa)] = VrXii) —VwXt, (2) 

Vx[TXft>]==a).VT-V.TCO —T.Vo)-h V.OT, (3) 

V(T.a)) = VT.tt) +Va).T, (4) 

V.{t£o} = V.TCD-br.Vo), (5) 

Vx{Ta)} = Vxtw — txVo), (6) 

V.{u#} = (7) 

etc. 


The principle in all these cases is that if we have one of the operators 
Vy V., Vx prefixed to a product of any kind, and we make any 
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transformation of the expression which would be allowable if the V 
were a vector (viz., by changes in the order of the factors, in the 
signs of multiplication, in the parentheses written or implied, etc.), by 
which changes the V is brought into connection with one particular 
factor, the expression thus transformed will represent the part of 
the value of the original expression which results from the variation 
of that factor. 

167. From the relations indicated in the last four paragraphs, may 
be obtained directly a great number of transformations -of definite 
integrals similar to those given in Nos. 74-77, and corresponding to 
those known in the scalar calculus by the name of integration by 
•parts. 

168. The student will now find no difiBculty in generalizing the 
integrations of differential equations given in Nos. 78-89 by applying 
to vectors those which relate to scalars, and to dyadics those which 
relate to vectors. 

169. The propositions in No. 90 relating to minimum values of the 
volume-integral fjfuuy.wdv may be generalized by substituting 
(*).#.<«) for uuy.o), ^ being a given dyadic function of position in space. 

170. The theory of the integrals which have been called potentials, 

Newtonians, etc. (see Nos. 91-102) may be extended to cases in which 
the operand is a vector instead of a scalar or a dyadic instead of a 
vector. So far as the demonstrations are concerned, the case of a 
vector may be reduced to that of a scalar by considering separately 
its three components, and the case of a dyadic may be reduced to that 
of a vector, by supposing the dyadic expressed in the form -I- xi + 

and considering each of these terms separately. 


CHAPTER V. 

CONCERNING TRANSCENDENTAL FUNCTIONS OF DYADICS. 

171. Def ,—The exponential function, the sine and the cosine of a 
dyadic may be defined by infinite series, exactly as the corresponding 
functions in scalar analysis, viz., 

= I-f ^-4--f ^ +etc., 
sin ^ ^-h --etc., 

cos $ = I —-f 074 — etc. 

These series are always convergent. For every value of $ there is 
one and only one value of each of these functions. The exponential 
function may also be defined as the limit of the expression 
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when N, which is a whole number, is increased indefinitely. That 
this definition is equivalent to the preceding, will appear if the 
expression is expanded by the binomial theorem, which is evidently 
applicable in a case of this kind. 

These functions of ^ are homologous with #. 

172. We may define the logarithm as the function which is the 
inverse of the exponential, so that the equations 

^ = log 

are equivalent, leaving it undetermined for the present whether 
every dyadic has a logarithm, and whether a dyadic can have more 
than one. 

173. It follows at once from the second definition of the exponential 
function that, if ^ and are homologous, 

and that, if T is a positive or negative whole number, 

174. If S and ^ are homologous dyadics, and such that 

the definitions of No. 171 give immediately 
e®^ = cos#4-Ssin 
= cos ^ — H sin #, 

whence 

cos<^ = 

sin^=-ia;{e»^-e-»*}. 

175. If #.^ = ^.# = 0, 

{#-f= + = etc. 

cos{^H-^} =cos^-hcos‘^^ —I, 
sin{^-f'T'} =sinf>-h8in 

\e*\ = e\ 

For the first member of this equation is the limit of 

that is, of |I + N-^#|». 

If we set ^ = ai-h/Sj-fyA:, the limit becomes that of 

(l+N-la.^+N•■l^.i-|-N-ly./fc)^ or (l-fN-i^g)^ 
the limit of which is the second member of the equation to be proved. 
177. By the definition of exponentials, the expression 

represents the limit of 


Therefore 

176. 
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Now I-f evidently represents a versor having the axis i 

and the infinitesimal angle of version Hence the above ex¬ 

ponential represents a versor leaving the same axis and the angle of 
version q. If we set qi — w, the exponential may be written 

Such an expression therefore represents a versor. The axis and 
direction of rotation are determined by the direction of cd, and the 
angle of rotation is equal to the magnitude of w. The value of the 
versor will not be affected by increasing or diminishing the magnitude 
of w by 2‘7r. 

178. If, as in No. 151, 

^ = aaa + Cyy', 

the definitions of No. 171 give 

e* = c®aa' -h -h ^yy\ 

cos # = cos a aa 4- cos b /3^ -f cos c yy, 
sin # = sin a aa -f sin b 4- sin c yy. 

If a, 5, c are positive and unequal, we may add, by No. 172, 
log # = log a aa' 4“ log b /3/3' 4- log c yy. 

179. If, as in No, 153, 


# = aaa' -f 4* yy'} 4* cf y^ — ^y'} 

^aaa+pco9q{^^'-{-yy]-hpBmq{y/B' —/8y'}, 


we have by No. 173 ^ e^fip'+yy'} ^^{yP' - /sy I. 

6®** = ef^aa' 4- 4* yy', 

^^'-i-yy'} aa -j-yy'], 

€l^^y^-^'^=aa' 4- co8c{3i5'4-yy'} 4- sin c{y3'-“/8y'}. 
Therefore, _j_ ^ j _j_ ^ gjj^ ^ ^ 

Hence, if a is positive, 

log $=log a aa 4- log 2 ? -^yy]^q{yP^-py)- 

Since the value of # is not afiected by increasing or diminishing q by 
27r, the function log ^ is many-valued. 

To find the value of cos ^ and sin #, let us set 

0 = 5 {/ 3 ^ 4 yy'} 4 c{y/ 8 '~^y}, 

S=^yP'-py. 

Then, by No, 175, 


But 


cos # = cos { aaa'} 4 cos 0 — I. 
cos {aaa'] — I = cos aaa' — aa'. 


Therefore, 


cos # = cos aaa' — aa' 4 cos 0. 
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Now, by No. 174, 

cos 0 = J 


Since 


®.e= -«{^/3' + y>''}+/i{y^'-i8y'j, 

« e'^cos 6 {/8j6{' + yy'( 4-& {y^' —^y'}, 


e~^-^~aa' H- (fccmh {/5/3' + yy') — e^sin/j {yl^' — l^y'}- 

Therefore 

C080 = aa'-f c“^)cos6 {^j8' + yy'} — e"'’)sin [y^' — /3y]y 

and 

cos# —cosa««/4* i(e^‘he~*^}(^osb {fi/3' + yy} — hie^sin b{y/3'--/3y'}, 
In like manner we find 

sin # = sin aaa' *f il^l^' + yy'} + \(e''— e~^)cos b {y/3' —/8y'}. 

180. If «, y and </, y are reciprocals, and 

^~a(ut -{-}){ 4-yy ) 4 -c /3y', 
and N is any whole number, 

_ (i^aa 4- b'^ {j8/3' 4- yy'} + N 6^ * /3y'. 

Therefore, ^ 4 . _ ^ ^bj ^ yy' j _j. 

cos # = cos a aa + cos h {|8/3' + yy'} — c sin b ^y\ 
sin # = sin a aa 4* sin 6 {4- yy ( 4 c cos b l3y\ 

If a and h are unequal, and c other than zero, we may add 
log # = log a aa + log b (4- yy ) + r6 ' ^ j^y. 

181. If tt, f3y y, and </, y y are reciprocals, and 

# = al 4 ^ { afi' + fSy } 4“ (' ay, 
and N is a whole number, 

I + Na^-»6{a/8'-4/8y } + (Na^-^c4-AN(N -1 )a^-'^/r)ay'. 

Therefore 

14- 6{ a/8' 4- /8y'} + J6^-f c)ay', 

cos # = cos a I — 6 sin a {a/8'4-/8a'} — (Ah'^cos <( 4- c sin a)ay', 
sin # = sin a 1 4 eos a { a/3' + /8a'} — (A/>“sin o — c cos a)ay'. 

Unless b — 0, we may add 

log # = log a l4"^tt'^ {a/8'4*/8a'} 4-(<‘a — Ah-a ■'^)ay'. 

182. If we suppose any dyadic # to vary, but with the limitation 

that all its values are homologous, we may obtain from the definitions 
of No. 171 .,, 4 ., 


d{e^}^Ad4>y 
</sin# = cos#.r/#, 
d cos # = — sin #. f/#, 
fZlog# = #”^<^/#, 


( 1 ) 

( 2 ) 

(3) 

(4) 
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as in the ordinary calculus, but we must not apply these equations to 
cases in which the values of are not homologous. 

183. If, however, F is any constant dyadic, the variations of tF will 
necessarily be homologous with tF, and we may write without other 
limitation than that F is constant, 


(1) 

dsinlfF} „ 

df ■ =r-co9{<r). 

(2) 

<icos{^F} ^ 

(3) 

(ilog{^F} I 
dt ~t' 

(4) 

A second differentiation gives 


dt^ 

(5) 


(6) 


(7) 


184. It follows that if we have a differential equation of the form 


__ p 


P> 


the integral equation will be of the form 

P^e ,p, 

p representing the value of p for ^ = 0. For this gives 

dp jv A ri 

J^=r.e .p^r.p. 

and the proper value of p for < = 0. 

185. Def .—A flux which is a linear function of the position^vector 
is called a homogeneoua-straiyi-Jlux from the nature of the strain 
w'hich it produces. Such a flux may evidently be represented by a 
dyadic. 

In the equations of the last paragraph, we may suppose p to 
represent a position-vector, t the time, and F a homogeneous-strain- 
flux. Then will represent the strain produced by the flux F in 
the time t. 

In like manner, if A represents a bomogeneous strain, {logA}/^ 
will represent a homogeneous-strain-flux which would produce the 
strain A in the time t. 
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186. If we have 


dt^ ^ 


where F is complete, the integral equation will be of the form 
For this gives ^ 




d^p 


^ = r* e'^.a + rie-‘^.^ = r‘.p. 

and a and /3 may be determined so as to satisfy the equations 

Pt.o = a+A 

187. The differential equation 


will be satisfied by 


whence 


3^- 

p = co8{<r}.a -f 


^=3 -r.sin{ir}.a + r.co8{er}./8, 


d\ 


H=-r*.co8{«r}.a - r*.sin{^r}./S=-rip. 

If r is complete, the constants a and ^ may be determined to satisfy 
the equations 


188. If 


A 

w 


£={r»-A').p, 


where F* — A* is a complete dyadic, and 

F.A = A.F = 0, 

we may set 

p={je‘^+Je'“'4-C08{<A}-l}.a+{ie‘''-i«'‘^+8in{fA} }./3, 
which gives 

( jr.«''’-ir.e”‘'’-A.sin{«A} }.a 

+ {ir.e‘’'+ir.e-'*'+A.C08{tA}}./3, 

{ir*.e" + ir*.e-'’'-A*.cos{tA} }.a 

+ { A^ainfiA} }.|8 

= {r'-A2}.,». 
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The constants a and ^ are to be determined by 

= {r-hA}.^. 

D 

189. It will appear, on reference to Nos. 155-157, that every 
complete dyadic may be expressed in one of three forms, viz., as a 
square, as a square with the negative sign, or as a difference of squares 
of two dyadics of which both the direct products are equal to zero. 
It follows that every equation of the form 

where 0 is any constant and complete dyadic, may be integrated by 
the preceding formulae. 


NOTE ON BIVECTOR ANALYSIS.^» 

1. A vector is determined by three algebraic quantities. It often 
occurs that the solution of the equations by which these are to be 
determined gives imaginary values, i.e., instead of scalars we obtain 
biscalars, or expressions of the form a+ <5, where a and h are scalars, 
and —1. It is most simple, and always allowable, to consider 

the vector as determined by its components parallel to a normal 
system of axes. In other words, a vector may be represented in .the 

xi^yj-i-zk. 

Now if the vector is required to satisfy certain conditions, the solution 
of the equations which determine the values of x, y, and z, in the 
most general case, will give results of the form 

X ~ X-^ ’ 

2/ = 2/i + %» 

* Thus far, in accordance with the purpose expressed in the footnote on page 17, we 
have considered only real values of scalars and vectors. The object of this limitation 
has been to present the subject in the most elementary manner. The limitation is 
hov/ever often inconvenient, and does not allow the most symmetrical and complete 
development of the subject in many important directions. Thus in Chapter V, and the 
latter part of Chapter III, the exclusion of imaginary values has involved a considerable 
sacriiioe of simplicity both in the enunciation of theorems and in their demonstration. 
The student will find an interesting and profitable exercise in working over this part of 
the subject with the aid of imaginary values, especially in the discussion of the 
imaginary roots of the cubic equation on page 71, and in the use of the formula 

e‘* = co8<i»-f tsin^ 

in developing the properties of the smes, cosinett, and exponerUiah of dyadics. 
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where ^i> scalars. Substitutiug these values in 

xi + yj 4- zk, 

we obtain (Xi-h tx.^yi-^iyi-^^iy 2 )jiZ 2 )k, 

or, if we set /jj == i 4* yJ 4- z^k, 

P2==x^i-^yJ-^zJc, 

we obtain Pi + f/o-i- 

We shall call this a hivecAor, a term which will include a vector as a 
particular case. When we wish to express a bivector by a single 
letter, we shall use the small German letters. Thus we may write 

An important case is that in which and p., have the same direction. 
The bivector may then l^e expressed in the form {a-\-ib)p, in which 
the vector factor, if we choo.se, ma}^ be a unit vector. In this case, we 
may say that the bi vector has a real direction. In fact, if we express 
the bivector in the form 

(.r j 4- IX,y) i 4“ (?/i + iy^)) -f (^ 1 4* 4 ^ 2 ) 

the ratios of the coefficients of i, j, and k, which determine the direc¬ 
tion cosines of the vector, will in this case be real. 

2. The consideration that operations upon bivectors may be regarded 
as operations upon their biscaiar a’-, y- and ; 2 -component 8 is sufficient 
to show the possibility of a bivector analysis and to indicate what its 
rules must be. But this point of view does not afford the most simple 
conception of the operations which we have to perform upon bi vectors. 
It is desirable that the definitions of the fundamental operations should 
be independent of such extraneous considerations as any system of 
axes. 

The various signs of our analysis, when applied to bivectors, may 
therefore be defined as follows, viz.. 

The bivector equation 

IX IV ^ IXLv” 

implies the two vector equations 

fx = ix'y and V = v'. 

— + M + 

[ix' 4- IV ] 4- [/a" -f tv'] = [p 4- ix'] 4-1 [r 4* r'].* 

[fX 4- tr ] . [ix' 4-0'"'] = [ix.ix" — V. v'l 4-1 [ix. v' 4“ v-ix"\ 

[/x'4“n/']X [ix" 4” iv"] = [ix'Xfx" — |/'X v'] 4*f [yu'X v' 4 -v Xy"]. 


* {a + lb) [/A +1*'] — afi- h» + i[av -f bfi]. 

[fi + tv] (a + 16 )=fjvx — vh + i [jit 6 *f va]. 

Therefore the position of the scalar factor is indifferent. [MS. note by author.] 



86 


VECTOR ANALYSIS. 


With these definitions, a great part of the laws of vector analysis 
may be applied at once to bivector expressions. But an equation 
which is impossible in vector analysis may be possible in bivector 
analysis, and in general the number of roots of an equation, or of the 
values of a function, will be different according as we recognize, 
or do not recognize, imaginary values. 

3. Def .—Two bi vectors, or two biscalars, are said to be conjugate^ 
when their real parts are the same, and their imaginary parts differ 
in sign, and in sign only. 

Hence, the product of the conjugates of any number of bivectors 
and biscalars is the conjugate of the product of the bivectors and 
biscalars. This is true of any kind of product. 

The products of a vector and its conjugate are as follows: 

[ya + Xv]x [m "" = 2tvXAt 

Hence, if fi and iv represent the real and imaginary parts of a 
bivector, the values of 

/Jt./A+I'.l', yuXl/, yUM-fi'*', I'M 

are not affected by multiplying the bivector by a biscalar of the form 
in which 1, say a cyclic scalar. Thus, if we set 

yu'-f ii/' = (a+t5) [pi+iv]y 

we shall have 

p! — IV = (a— ih) [m — fi/], 

and 

«V]. [p! — tv] «[m4* tv]. [p — tv]. 

That is, 

p.p -^-v.v—p.p-^v.v'y 

and so in the other cases. 

4. Def .—In biscalar analysis, the product of a biscalar and* its conju> 
gate is a positive scalar. The positive square root of this scalar is 
called the modulus of the biscalar. In bivector analysis, the direct 
product of a bi vector and its conjugate is, as seen above, a positive 
scalar. The positive square root of this scalar may be called the 
modulus of the bivector. When this modulus vanishes, the bivector 
vanishes, and only in this case. If the bivector is multiplied by a 
biscalar, its modulus is multiplied by the modulus of the biscalar. 
The conjugate of a (real) vector is the vector itself, and the modulus 
of the vector is the same as its magnitude. 

5. Def .—If between two vectors, a and ^8, there subsists a relation 
of the form 
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where n is a scalar, we say that the vectors are parallel. Analogy 
leads us to call two bivectors parallel, when there subsists between 
them a relation of the form 

a = mb, 

where m (in the most general case) is a biscalar. 

To aid us in comprehending the geometrical signification of this 
relation, we may regard the biscalar as consisting of two factors, one 
of which is a positive scalar (the modulus of the biscalar), and the 
other may be put in the form cos g + * sin q. The effect of multiplying 
a bivector by a positive scalar is obvious. To understand the effect of 
a multiplier of the form cos q -h i sin q upon a bivector yu -b tv, let us set 

M==(cos g + ^ sin 

We have then 

fjL = cos q fJL — sin q v, 

V — cos qv sin q fi. 

Now if fi and v are of the same magnitude and at right angles, the 
effect of the multiplication is evidently to rotate these vectors in 
their plane an angular distance q, which is to be measured in the 
direction from i/ to fi. In any case we may regard fi and y as the pro¬ 
jections (by parallel lines) of two perpendicular vectors of the same 
length. The two last equations show that fi' and v' will be the 
projections of the vectors obtained by the rotation of these perpendi¬ 
cular vectors in their plane through the angle q. Hence, if we 
construct an ellipse of which fi and v are conjugate semi-diameters, fi 
and y will be another pair of conjugate semi-diameters, and the sectors 
between fi and fi' and between v and v', will each be to the whole area 
of the ellipse as g to 2'7r, the sector between v and y' lying on the same 
side of y and fi, and that between fi and fi lying on the same side 
of yu as —V. 

It follows that any bi vector fi-^iv may be put in the form 
(cos q + t sin q) [a + 1/3], 

in which a and /3 are at right angles, being the semi-axes of the ellipse 
of which fi and y are conjugate semi-diameters. This ellipse we may 
call the directional ellipse of the bivector. In the case of a real 
vector, or of a vector having a real direction, it reduces to a straight 
line. In any other case, the angular direction from the imaginary to 
the real part of the bi vector is to be regarded as positive in the ellipse, 
and the specification of the ellipse must be considered incomplete 
without the indication of this direction. 

Parallelism of bivectors, then, signifies the similarity and similar 
position of their directional ellipses. Similar position includes iden¬ 
tity of the angular directions mentioned above. 
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6. To reduce a given bivector r to the above form, we may set 

r.r==(cos sin g)“[a-f f/9].[a + //8] 

= (cos 2 ^ 7 -f-/ sin 2q) {a.a — ^./3) 

= a -f- th, 

where a and b are scalars, which we may regard as known. The 
value of q may be determined by the equation 

tan 2a = -, 

^ a 

the quadrant to which 2q belongs being determined so as to give 
sin 2q and cos 2q the same signs as h and a. Then a and ^ will be 
given by the equation 

« 4- = (cos q — i sin q)x. 

The solution is indeterminate when the real and imaginary parts of 
the given bivector are perpendicular and equal in magnitude. In this 
case the directional ellipse is a circle, and the bivector may be called 
circular. The criterion of a circular bi vector is 

T.r = 0. 

It is especially to be noticed that from this equation we cannot 
conclude that 

r=0, 

as in the analysis of real vectors. This may also he shown by ex¬ 
pressing r in the form u:l + yj-\~zk, in which x, y, z are biacalars. The 
equation then becomes 

X- 4 - y^ + z- = 0 , 

which evidently does not re(iuire x, y, and z to vanish, as would be 
the case if only real values are considered. 

7. Def. —We call tw^ vect(.'rs p and cr perpendicular when p.tr^^i). 
Following the same analogy, we shall call two bivectors r and ^ 
perpendicular, when 

r.d = 0. 

In considering the geometrical signification of this equation, we shall 
first suppose that the real and imaginary components of r and ^ lie in 
the same plane, and that l:x)th r and $ have not real directions. It is 
then evidently po.ssible to express them in the form 

vi[a + f/8], m' [a' 4- i/3'], 

where and m are biscalar, a and /8 are at right angles, and a 
parallel with /3. Then the equation 1.^ = 0 requires that 

8.^' = 0, and a./8' + |9.a' = 0. 

This shows that the directional ellipses of the two bivectors are 
similar and the angular direction from the real to the imaginary 
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component is the same in both, but the major axes of the ellipses are 
perpendicular. The case in which the directions of r and d are real, 
forms no exception to this rule. 

It will be observed that every circular bivector is perpendicular to 
itself, and to every parallel bivector. 

If two bivectors, which do not lie in the same plane 

are perpendicular, we may resolve /u and v into components parallel 
and perpendicular to the plane of /jl and v\ The components perpen¬ 
dicular to the plane evidently contribute nf>thing to the value of 

. [m'4" 

Therefore the components of fj. and v parallel to the plane of fx, v\ 
form a bi vector which is perpendicular to iv. That is, if two 
bivectors are perpendicular, the directional ellipse of either, projected 
upon the plane of the other and rotated through a quadrant in that 
plane, will be similar and similarly situated to the directional ellipse 
of the second. 

8. A bi vector may be divided in one and only one way into parts 
parallel and perpendicular to another, provided that the second is not 
circular. If a and b are the bivectors, the parts of a will be 


b.a. 


and 


b.Q, 


If b is circular, the resolution of a is impossible, unless it is perpen¬ 
dicular to b. In this case the resolution is indeterminate. 

9. Since axb.a = 0, and axb.b = 0, axb is perpendicular to o and b. 
We may regard the plane of the product as determined by the 
condition that the directional ellipses of the factors projected upon 
it become similar and similarly situated. The directional ellipse of 
the product is similar to these projections, but its orientation is 
different by 90°. It may easily be shown that axb vanishes only 
with a or b, or when a and b are parallel. 

10. The bi vector equation 

(ax&.c)b—(b,cxb)a-|-(c,bxa)b — (b.axb)c = 0 

is identical, as may be verified by substituting expressions of the 
form (^, V, ^ being biscalars), for each of the bi vectors. 

(Compare No. 37.) This equation shows that if the product axb 
of any two bivectors vanishes, one of these will be equal to the other 
with a biscalar coefficient, that is, they will be parallel, according 
to the definition given above. If the product a.bxt of any three 
bi vectors vanishes, the equation shows that one of these may be 
expressed as a sum of the other two with biscalar coefficients. In 
this case, we may say (from the analogy of the scalar analysis) that 
the three bivectors are complanar. (This does not imply that they 
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lie in any same real plane.) If o.bxc is not equal to zero, the 
equation shows that any fourth bivector may be expressed as a sum 
of a, b, and c with biscalar coefficients, and indicates how these 
coefficients may be determined. 

11. The equation 

(r,a)bxc+(t.b)cxa-j-(t.c)axb = (axb.c)r 

is also identical, as may easily be verified. If we set 

c = nxb, 

and suppose that ^ ^ 

t.a = 0, t.b = 0, 

the equation becomes 

(T.axb)oxb = (axb.axb)r. 

This shows that if a hi vector r is perpendicular to two bi vectors 
a and b, which are not parallel,! will be parallel to Qxb. Therefore 
all bivectors which are perpendicular to two given bivectors are 
parallel to each other, unless the given two are parallel. 


[Note by iJdiiori.—-The notation j4>| used on page 64, was later improved by the 

author by the introduction of his Double Multiplication, according to which the above 
expression is represented by 1*1* volume, pages 112, 160, 

and ISl. For an extended treatment of Professor Gibbs’s researches on Double Multi¬ 
plication in their application to VecUir Analysis see pp. 306-321, and 333 of “Vector 
Analysis,” by E. B. Wilson, Chas. Scribner's Sons, New York, 1901.] 



IV. 


ON MULTIPLE ALGEBRA. 

Address before the Section of Mathematics and Astronomy of the American 
Association for the Advancement of Science, by the Vice-President. 

[Proceedings of the American Association for the Advancement of Science^ 
vol. XXXV. pp. 37-66, 1886.] 

It has been said that “ the human mind has never invented a labor- 
saving machine equal to algebra.”* If this be true, it is but natural 
and proper that an age like our own, characterized by the multi¬ 
plication of labor-saving machinery, should be distinguished by an 
unexampled development of this most refined and most beautiful of 
machines. That such has been the case, none will question. The 
improvement has been in every part. Even to enumerate the principal 
lines of advance would be a task for any one; for me an impossibility. 
But if we should ask, in what direction the advance has been made 
which is to characterize the development of algebra in our day, we 
may, I think, point to that broadening of its field and methods which 
gives us multiple algebra. 

Of the importance of this change in the conception of the office of 
algebra, it is hardly necessary to speak: that it is really characteristic 
of our time will be most evident if we go back some two or three¬ 
score years, to the time when the seeds were sown which are now 
yielding so abundant a harvest. The failure of Mobius, Hamilton, 
Grassmann, Saint-Venant to make an immediate impression upon 
the course of mathematical thought in any way commensurate with 
the importance of their discoveries is the most conspicuous evidence 
that the times were not ripe for the methods which they sought to 
introduce. A satisfactory theory of the imaginary quantities of 
ordinary algebra, which is essentially a simple case of multiple 
algebra, with difficulty obtained recognition in the first third of this 
century. We must observe that this double algebra, as it has been 
called, was not sought for or invented;—it forced itself, unbidden, 
upon the attention of mathematicians, and with its rules already 
formed. 


* The NcUicn, vol. xxxiii, p. 237. 
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But the idea of double algebra, once received, although as it were 
unwillingly, must have suggested to many minds more or less 
distinctly the possibility of other multiple algebras, of higher orders, 
possessing interesting or useful properties. 

The application of double algebra to the geometry of the plane 
suggested not unnaturally to Hamilton the idea of a triple algebra 
wliich should be capable of a similar application to the geometry of 
three dimensions. He was unable to find a satisfactory triple algebra, 
but discovered at length a quadruple algebra, guaterniom, which 
answered his purpose, thus satisfying, as he says in one of his letters, 
an intellectual want which had haunted him at leavst fifteen years. 
So confident was he of the value of this algebra, that the same hour 
he obtained permi.s8ion to lay his discovery before the Royal Irish 
Academy, which he did on November 13, 1843.* This system of 
multiple algebra is far better known than any other, except the 
ordinary double algebra of imaginary quantities,—far too well known 
to require any especial notice at my hands. All that here requires 
our attention is the close historical connection l)etween the iinaginaries 
of ordinary algebra and Hamilton’s system, a fact emphasized by 
Hamilton himself and most writers on quaternions. It was quite 
otherwise with Mdbius and Grassmann. 

The point of departure of the Barycentriscker Calcul of Mobius, 
published in 1827,—a work of which Clebsch has said that it can 
never be admired encugh,t—is the use of equations in which the 
terms consist of letters representing points with numerical coefficients, 
to express barycentric relations between the points. Thus, that the 
point /.S is the centre of gravity of weights, a, 6, c, d, placed at the 
points A, B, C, D, respectively, is expressed by the equation 

(cf -f" ^ "b C d^ S — (I A -b hB “b cC -b dD. 

An equation of the more general form 

aA -b hB -rcO-betc. =pP-^qQ + rRA- etc. 

signifies that the weights a, b, c, etc., at the points A, B, C, etc., have 
the same sum and the same centre of gravity as the weights q, r, 
etc., at the points P, Q, R, etc., or, in other words, that the former are 
barycentrically equivalent to the latter. Such equations, of which 
each represents four ordinary equations, may evidently be multiplied 
or divided by scalars,J may be added or subtracted, and may have 

^ Fhil. Mag. (3), vol. xxv, p, 49(); North BrHtiah Revieu-^ vol. xlv (1866), p. 57. 

t8ee his eulogy ou Pliicker, p. 14, Gbti. Abhand/.^ vol. xvi. 

11 use this term in Hamilton’a sense, to denote the ordinary positive and negative 
quantities of algebra. It may, however, be oi>8erved that in most oases in which I shall 
have occasion to use it, the proposition would hold without exclusion of imaginary 
quantities,—that this exclusion is generally for simplicity and not from necessity. 
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their terms arranged and transposed, exactly like the ordinary 
equations of algebra. It follows that the elimination of letters 
representing points from equations of this kind is performed by the 
rules of ordinary algebra. This is evidently the beginning of a 
quadruple algebra, and is identical, as far as it goes, with Grassmann’s 
marvellous geometrical algebra. 

In the same work we find also, for the first time so far as I am 
aware, the distinction of positive and negative consistently carried 
out on the designation of segments of lines, of triangles, and of tetra- 
hedra, viz., that a change in place of two letters, in such expressions 
2 ^ A B, ABC, ABC By is equivalent to prefixing the negative sign. It 
is impossible to overestimate the importance of this step, which gives 
to designations of this kind the generality and precision of algebra. 

Moreover, ii Ay By Care three points in the same straight line, and 
D &ny point outside of that line, the author observes that we have 
AB+BC^CA=:0y 

and also, with B prefixed, 

BAB^BBC-hBCA=^0. 

Again, if Ay By Cy B are four points in the same plane, and E any 
point outside of that plane, we have 

ABC-^BCB^^CBA - BAB =0, 
and also, with E prefixed, 

EABC-^ EBCB -f EC BA - EBAB = 0. 

The similarity to multiplication in the derivation of these formulae 
cannot have escaped the authors notice. Yet he does not seem to 
have been able to generalize these processes. It was reserved for the 
genius of Grassmann to see that AB might be regarded as the 
product of A and By BAB as the product of B and AB, and EABC 
as the product of E and aBC. That Mobius could not make this step 
was evidently due to the fact that he had not the conception of the 
addition of other multiple quantities than such as may be represented 
by masses situated at points. Even the addition of vectors (i.e., the 
fact that the composition of directed lines could be treated as an 
addition) seems to have been unknown to him at this time, although 
he subsequently discovered it, and used it in his Mechanik dea 
HimmelSy which was published in 1843. This addition of vectors, 
or geometrical addition y seems to have occurred independently to 
many persons. 

Seventeen years after the Barycentrischer Calculy in 1844, the year 
in which Hamilton’s first papers on quaternions appeared in print, 
Grassmann published his Lineale Auadehnungalehre, in which he 
developed the idea and the properties of the external or combinatorial 
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product, a conception which is perhaps to be regarded as the greatest 
monument of the authors genius. This volume was to have been 
followed by another, of the nature of which some intimation was 
given in the preface and in the work itself. We are especially told 
that the internal product ,which for vectors is identical except in 
sign with the scalar part of Hamilton’s product (just as Grassmann’s 
external product of two vectors is practically identical with the 
vector part of Hamilton’s product), and the open product,^ which in 
the language of to-day would be called a matrix, were to be treated 
in the second volume. But both the internal product of vectors and 
the open product are clearly defined, and their fundamental properties 
indicated, in this first volume. 

This remarkable work remained unnoticed for more than twenty 
years, a fact which was doubtless due in part to the very abstract and 
philosophical manner in which the subject was presented. In con¬ 
sequence of this neglect the author changed his plan, and instead of a 
supplementary volume published in 1862 a single volume entitled 
Auadehnungalehre, in which were treated, in an entirely different 
style, the same topics as in the first volume, as well as those which he 
had reserved for the second. 

Deferring for the moment the discussion of these topics in order to 
follow the course of events, we find in the year following the first 
Auadehnungslehre a remarkable memoir of Saint-Venant {, in which 
are clearly described the addition both of vectors and of oriented 
areas, the differentiation of these with respect to a scalar quantity, 
and a multiplication of two vectors and of a vector and an oriented 
area. These multiplications, called by the author geometrical, are 
entirely identical with Grassmann’s external multiplication of the 
same quantities. 

It is a striking fact in the history of the subject, that the short 
period of less than two years was marked by the appearance of 
well-developed and valuable systems of multiple algebra by British, 
German, and French authors, working apparently entirely inde¬ 
pendently of one another. No system of multiple algebra had 
appeared before, so far as I know, except such as were confined to 
additive processes with multiplication by scalars, or related to the 
ordinary double algebra of imaginary quantities. But the appearance 
of a single one of these systems would have been sufficient to mark 
an epoch, perhaps the most important epoch in the history of the 
subject. 

In 1853 and 1854, Cauchy published several memoirs on what 
he called clef a algdhriquea.^ These were units subject generally to 

* See the preface. t See § 172. 

t Comptea Rmdua, vol. xxi, p. 620. § Comptts Rendus, vols. xxxvi, flf. 
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combinatorial multiplication. His principal application was to the 
theory of elimination. In this application, as in the law of multi¬ 
plication, he had been anticipated by Grassmann. 

We come next to Cayley’s celebrated Memoir oii the Theory of 
Matrices* in 1858, of which Sylvester has said that it seems to him 
to have ushered in the reign^ of Algebra the Second, t I quote this 
dictum of a master as showing his opinion of the importance of the 
subject and of the 'memoir. But the foundations of the theory of 
matrices, regarded as multiple quantities, seem to me to have been 
already laid in the Ausdehnungslehre of 1844. To Grassmann's 
treatment of this subject we shall recur later. 

After the Axisdehnungslelt/re of 1862, already mentioned, we come 
to Hankel’s Vorlesungen uber die coTtiplexen Zahlen, 1867. Under 
this title the author treats of the imaginary quantities of ordinary 
algebra, of what he calls alteimirende Zahlen, and of quaternions. 
These alternate numbers, like Cauchy’s clefs, are quantities subject 
to Grassmann’s law of combinatorial multiplication. This treatise, 
published twenty-three years after the first Ausdehnungslehre, marks 
the first impression which we can discover of Grassmann s ideas upon 
the course of mathematical thought. The transcendent importance 
of these ideas was fully appreciated by the author, whose very able 
work seems to have had considerable influence in calling the attention 
of mathematicians to the subject. 

In 1870, Professor Benjamin Peirce published his Linear Associative 
Algebra, subsequently developed and enriched by his son, Professor 
C. S. Peirce. The fact that the edition was lithographed seems to 
indicate that even at this late date a work of this kind could only 
be regarded as addressed to a limited number of readers. But the 
increasing interest in such subjects is shown by the republication of 
this memoir in 1881, ^ as by that of the first Ausdehnungslehre in 
1878. 

The article on quaternions which has just appeared in the Encyclo- 
pcedia Britannica mentions twelve treatises, including second editions 
and translations, besides the original treatises of Hamilton. That all 
the twelve are later than 1861 and all but two later than 1872 shows 
the rapid increase of interest in this subject in the last years. 

Finally, we arrive at the Lectures on the Principles of Universal 
Algebra by the distinguished foreigner whose sojourn among us has 
given such an impulse to mathematical study in this country. The 
publication of these lectures, commenced in 1884 in the American 
Journal of Mathematics, has not as yet been completed,—a want but 
imperfectly supplied by the author’s .somewhat desultory publication 

* FhU. I'rans., vol. oxlviii. f Amer. Joum. MoUh,, vol. vi, p. 271. 

Joum. Math., vol. iv. 
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of many remarkable papers on the same subject (which might be 
more definitely expressed as the algebra of matrices) in various 
foreign journals. 

It is not an accident that this century has seen the rise of multiple 
algebra. The course of the development of ideas in algebra and in 
geometry, although in the main independent of any aid from this 
source, has nevertheless to a very large extent been of a character 
which can only find its natural expression in multiple algebra. 

Our Modem Higher Algebra is especially occupied with the theory 
of linear transformations. Now what are the first notions which we 
meet in this theory ? We have a set of n variables, say x, y, z, and 
another set, say c\ y\ z\ which are homogeneous linear functions of 
the first, and therefore expressible in terms of them by means of a 
block of coefficients. Here the (juantities occur by sets, and invite 
the notations of multiple algebra. It was in fact shown by Grass- 
mann in his first Ausdehnum/slehre and by Cauchy nine years later, 
that the notations of multiple algebra afford a natural key to the 
subject of elimination. 

Now I do not merely mean that we may save a little time or space 
by writing perhaps p for x, y and c; p for x, y and z ; and ^ for a 
block of quantities. But I mean that the subject as usually treated 
under the title of determinants ha.s a .stunted and misdirected develop¬ 
ment on account of the limitations of single algebra. This will appear 
from a very simple illastration. After a little preliminary matter 
the .student comes generally to a chapter entitled “ Multiplication of 
Determinants,'’ in which he is taught that the product of the deter¬ 
minants of two matrices may be found by perfoiining a somewhat 
lengthy operation on the two matrices, by which he obtains a third 
matrix, and then taking the determinant of this. But what signifi¬ 
cance, what value has this theorem ( For aught that appears in the 
majority of treatises which I have .seen, we have only a complicated 
and lengthy way of performing a simple operation. The real facts 
of the case may be .stated as follow’s: 

Suppose the .set of n (juantitie^ p to be derived from the set p by 
the matrix which we may express by 

and suppose the set p' to be derived fron? thv p by the matrix i.e., 
and = 

it is evident that p" can be derived frc'Ui p by the operation of a 
.single matrix, say 0. i.e., 

p'=e.p, 

NO thnt 6 = '^'.#. 
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In the language of multiple algebra 0 is called the product of 
^ and It is of course interesting to see how it is derived from 
the latter, and it is little more than a schoolboy’s exercise to determine 
this. Now this matrix 0 has the property that its determinant 
is equal to the products of the determinants of ^ and And this 
property is all that is generally stated in the books, and the funda¬ 
mental property, which is all that gives the subject its interest, that 0 
is itself the product of ^ and ^ in the language of multiple algebra, 
i.e., that operating by 0 is equivalent to operating successively by 
^ and is generally omitted. The chapter on this subject, in most 
treatises which I have seen, reads very like the play of Hamlet with 
Hamlet’s part left out. 

And what is the cause of this omission ? Certainly not ignorance 
of the property in question. The fact that it is occasionally 
given would be a sufficient bar to this answer. It is because 
the author fails to see that his real subject is matrices and not 
determinants. Of course, in a certain sense, the author has a right 
to choose his subject. But this does not mean that the choice 
is unimportant, or that it should be determined by chance or by 
caprice. The problem well put is half solved, as we all know. 
If one chooses the subject ill, it will develop itself in a cramped 
manner. 

But the case is really much worse than 1 have stated it. Not only 
is the true significance of the formation of 0 from ^ and ^ not given, 
but the student is often not taught to form the matrix which is the 
product of ^ and but one which is the product of one of these 
matrices and the conjugate of the other. Thus the proposition which 
is proved loses all its simplicity and significance, and must be recast 
before the instructor can explain its true bearings to the student. 
This fault has been denounced by Sylvester, and if anyone thinks 
I make too much of the standpoint from which the subject is viewed, 
I will refer him to the opening paragraphs of the Lectures on 
Universal Algebra” in the sixth volume of the American Journal of 
where, with a wealth of illustration and an energy 
of diction which I cannot emulate, the most eloquent of mathe¬ 
maticians expresses his sense of the importance of the substitution 
of the idea of the matrix for that of the determinant. If then so 
irrxportant, why was the idea of the matrix let slip ? Of course the 
writers on this subject had it to commence with. One cannot even 
define a determinant without the idea of a matrix. The simple fact is 
that in general the writers on this subject have especially developed 
those ideas which are naturally expressed in simple algebra, and have 
postponed or slurred over or omitted altogether those ideas which 
find their natural expression in multiple algebra. But in this subject 
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the latter happen to be the fundamental ideas, and those which ought 
to direct the whole course of thought. 

I have taken a very simple illustration, perhaps the very first 
theorem which meets the student after those immediately connected 
with the introductory definitions, both because the simplest illustration 
is really the best, and because 1 am here most at home. But the 
principles of multiple algebra seem to me to shed a flood of light into 
every corner of the subjects usually treated under the title of deter¬ 
minants, the subject gaining as much in breadth from the new notions 
as in simplicity from the new notations; and in the more intricate 
subjects of invariants, covariants, etc., 1 believe that the principles of 
multiple algebj-a are ready to perform an equal service. Certainly 
they make many things seem very simple to me which I should 
otherwise find difficult of comprehension. 

Let us turn to geometry. 

If we were asked to characteidze in a single word our modern 
geometry, we would perhaps say that it is a geometry of position. 
Now position is essentially a multiple quantity, or, if you prefer, is 
naturally represented in algebra by a multiple quantity. And the 
growth in this century of the so-called synthetic as opposed to 
analytical geometry seems due to the fact that by the ordinary 
analysis geometers could not easily express, except in a cumbersome 
and unnatural manner, the sort of relations in which they were par¬ 
ticularly interested. With the introduction of the notations of multiple 
algebra, this difficulty falls away, and with it the opposition between 
synthetic and analytical geometry. 

It is, however, interesting and very instructive to observe how the 
ingenuity of mathematicians has often triumphed over the limitations 
of ordinary algebra, A conspicuous example and one of the simplest 
is seen in the Mecanique Analytique, where the author, by the use 
of what are sometimes called indeterminate equations, is able to write 
in one equation the equivalent of an indefinite number. Thus the 
equation 

X (lx 4- y dy -\-Zdz — 0, 

by the indeterininateness of the values of dx, dy, dz, is made eciui va¬ 
lent to the three equations 

X-0, r=o, Z=0. 

It is instructive to compare this with 

Xi 4 Vj'A-Zk =: 0, 

which is the form that Hamilton or Grassmann would have used. 
The use of this analytical artifice, if such it can be called, runs all 
through the work and is fairly characteristic of it. 

Again, the introduction of the potential in the theory of gravity, or 
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electricity, or magnetism, gives us a scalar quantity instead of a vector 
as the subject of study; and in mechanics generally the use of the 
force-function substitutes a simple quantity for a complex. This 
method is in reality not different from that just mentioned, since 
Lagrange’s indeterminate equation expresses, at least in its origin, 
the variation of the force-function. It is indeed the real beauty of 
Lagrange’s method that it is not so much an analytical artifice, as the 
natural development of the subject. 

In modern analytical geometry we find methods in use which are 
exceedingly ingenious, and give forms curiously like those of multiple 
algebra, but which, at least if logically carried out very far, are exces¬ 
sively artificial, and that for the expression of the simplest things. 
The simplest conceptions of the geometry of three dimensions are 
points and planes, and the simplest relation between these is that a 
point lies in a plane. Let us see how these notions have been handled 
by means of ordinary algebra, and by multiple algebra. It will illus¬ 
trate the characteristic difference of the methods, perhaps as well as 
the reading of an elaborate treatise. 

In multiple algebra a point is designated by a single letter, just as 
it is in what is called synthetic geometry, and as it generally is by the 
ordinary analyst when he is not writing equations. But in his 
equations, instead of a single letter the analyst introduces several 
letters (coordinates) to represent the point. 

A plane may be represented in multiple algebra as in synthetic 
geometry by a single letter; in the ordinary algebra it is sometimes 
represented by three coordinates, for which it is most convenient to 
take the reciprocals of the segments cut off by the plane on three axes. 
But the modem analyst has a more ingenious method of representing 
the plane. He observes that the equation of the plane may be written 

= ( 1 ) 
where rj, f are the reciprocals of the segments, and x, y, z are the 
coordinates of any point in the plane. Now if we set 

( 2 ) 

this letter will represent an expression which represents the plane. In 
fact, we may say that p implicitly contains and f, which are the 
coordinates of the plane. We may therefore speak of the plane p, and 
for many purposes can introduce the letter p into our equations instead 
of For example, the equation 

( 3 ) 



<4) 
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It is to be noticed that on account of the indeterminateness of the 
X, y, and z, this method, regarded as an analytical artifice, is identical 
with that of Lagrange, also that in multiple algebra we should have 
an equation of precisely the same form as (3) to express the same 
relation between the planes, but that the equation would be explained 
to the student in a totally different manner. This we shall see more 
particularly hereafter. 

It is curious that we have thus a simpler notation for a plane than 
for a point. This however may be reversed. If we commence with 
the notion of the coordinates of a plane, tj, the equation of a 
point (i.e., the equation between jy, f which will hold for every 
plane passing through the point) will be 

(5) 

where x, y, z are the coordinates of the point. Now if we set 


q = x(+y;j+z^, ( 6 ) 

we may regard the single letter q as representing the point, and use 
it, in many cases, instead of the coordinates x, y, z, which indeed it 
implicitly containa Thus we may write 


for the three equations 


X =« 


■ 2 




z'-{-z" 


0 ) 


( 8 ) 


Here, by an analytical artifice, we come to equations identical in 
form and meaning with those used by Hamilton, Grassmann, and even 
by Mdbius in 1827. But the explanations of the formulae would 
differ widely. The methods of the founders of multiple algebra are 
characterized by a bold simplicity, that of the modern geometry by a 
somewhat bewildering ingenuity. That p and q represent the same 
expression (in one case x, y, z, and in the other rj, f being indeter¬ 
minate) is a circumstance which may easily become perplexing. I am 
not quite certain that it would be convenient to use both of these 
abridged notations at the same time. In fact, if the geometer using 
these methods were asked to express by an equation in p and q that 
the point q lies in the plane p, he might find himself somewhat 
entangled in the meshes of his own ingenuity, and need some new 
artifice to extricate himself. I do not mean that his genius might 
not possibly be equal to the occasion, but I do mean very seriously 
that it is a vicious method which requires any ingenuity or any 
artifice to express so simple a relation. 

If we use the methods of multiple algebra which are most com¬ 
parable to those just described, a point is naturally represented by a 
vector (p) drawn to it from the origin, a plane by a vector (a*) drawn 
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from the origin perpendicularly toward the plane and in length equal 
to the reciprocal of the distance of the plane from the origin. The 
equation * ■ // 

(9) 


will have precisely the same meaning as equation (3), and 


P = 




( 10 ) 


will have precisely the same meaning as equation (7), viz., that the 
point p" is in the middle between p and p\ That the point p lies in 
the plane <r is expressed by equating to unity the product of p and a- 
called by Graasrnann internal, or by Hamilton called the scalar part 
of the product taken negatively. By whatever name called, the 
quantity in question is the product of the lengths of the vectors and 
the cosine of the included angle. It is of course immaterial what 
particular sign we use to express this product, as whether we write 

p.(T — \y or #S^pcr=-“1. (11) 

I should myself prefer the simplest possible sign for so simple a 
relation. It may be observed that p and cr may be expressed as the 
geometrical sum of their components parallel to a set of perpendicular 
axes, viz., 

p = xi-\-yj-\-zky = + ( 12 ) 

By substitution of these values, equation (11) becomes by the laws of 
this kind of multiplication 

(13) 

My object in going over these elementary matters is to call attention 
to the very roundabout way in which the ordinary analysis makes 
out to represent a point or a plane by a single letter, as distinguished 
from the directness and simplicity of the notations of multiple algebra, 
and also to the fact that the representations of points and planes by 
single letters in the ordinary analysis are not, when obtained, as 
amenable to analytical treatment as are the notations of multiple 
algebra. 

I have compared that form of the ordinary analysis whieh relates 
to Cartesian axes with a vector analysis. But the case is essentially 
the same if we compare the form of ordinary analysis which relates 
to a fundamental tetrahedron with Grassmann’s geometrical analysis, 
founded on the point as the elementary quantity. 

In the method of ordinary analysis a point is represented by four 
coordinates, of which each represents the distance of the point from 
a plane of the tetrahedron divided by the distance of the opposite 
vertex from the same plane. The equation of a plane may be put 
in the form 




(14) 
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where rjj co are the distances of the plane from the four points, 
and Xy y, Zy w are the coordinates of any point in the plane. Here 
we may set 

p=:^^x-{-tjy + ^z-^a)Wy (15) 

and say that p represents the plane. To some extent we can intro¬ 
duce this letter into equations instead of rj, w. Thus the equation 


lp'-}-mp"-\-np'" = 0 (16) 

(which denotes that the planes p', p'\ p"\ meet in a common line, 
making angles of which the sines are proportional to ly m, and n) is 
equivalent to the four equations 

+ = = etc. (17) 


Again, we may regard rj, to as the coordinates of a plane, 
equation of a point will then be 

^ ^ + 2/^ f +'ty a> = 0. 

If we set 


q—x^-^yrj + z^-^ww, 


The 

(18) 

(19) 


we may say that q represents the point. The equation 


q 


nt 


2 ’ 


( 20 ) 


which indicates that the point q'"' bisects the line between q' and q''y 
is equivalent to the four equations 

f = (21) 


To express that the point q lies in the plane p does not seem easy, 
without going back to the use of coordinates. 

The form of multiple algebra which is to be compared to this is the 
geometrical algebra of Mobius and Grassmann, in which points without 
reference to any origin are represented by single letters, say by Italic 
capitals, and planes may also be represented by single letters, say by 
Greek capitals. An equation like 

0 "'=^^', ( 22 ) 

has exactly the same meaning as equation (20) of ordinary algebra. So 

m'-hmn"+tin"'=o (23) 

has precisely the same meaning as equation (16) of ordinary algebra- 
That the point Q lies in the plane II is expressed by equating to zero 
the product of Q and 11 which is called by Grassmann external and 
which might be defined as the distance of the point from the plane. 
We may write this 


QxH-O. 


(24) 
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To show that so simple an expression is really amenable to analytical 
treatment, I observe that Q may be expressed in terms of any four 
points (not in the same plane) on the barycentric principle explained 
above, viz., Q==xA+yB+zG+wD, (26) 

and n may be expressed in terms of combinatorial products of 
A, By Cy and D, viz., 

n = ^^xC'xZ)-f xD4*fDx^xJ?+a)^xCx^, (26) 

and by these substitutions, by the laws of the combinatorial product 
to be mentioned hereafter, equation (24) is transformed into 

W(jB-{-x^+yr)-\-z^—0y (27) 

which is identical with the formula of ordinary analysis.* 

I have gone at length into this very simple point in order to 
illustrate the fact, which I think is a general one, that the modem 
geometry is not only tending to results which are appropriately 
expressed in multiple algebra, but that it is actually striving to clothe 
itself in forms which are remarkably similar to the notations of 
multiple algebra, only less simple and general and far less amenable 
to analytical treatment, and therefore, that a certain logical necessity 
calls for throwing off the yoke under which analytical geometry has 
so long labored. And lest this should seem to be the utterance of an 
uninformed enthusiasm, or the echoing of the possibly exaggerated 
claims of the devotees of a particular branch of mathematical study, 
I will quote a sentence from Clebsch and one from Clifford, relating 
to the past and to the future of multiple algebra. The former in 
his eulogy on Plucker,! in 1871, speaking of recent advances in 
geometry, says that “ in a certain sense the coordinates of a straight 
line, and in general a great part of the fundamental conceptions .of 
the newer algebra, are contained in the Aw^deJiTiunyHlehre of 1844,” 
and Clifford I in the last year of his life, speaking of the Ausdehn- 
ungslehre, with which he had but recently become acquainted, 
expresses “ his profound admiration of that extraordinary work, and 
his conviction that its principles will exercise a vast influence upon 
the futuie of mathematical science.” 

Another subject in which we find a tendency toward the forms 
and methods of multiple algebra, is the calculus of operations. Our 
ordinary analysis introduces operators, and the successive operations 
A and B may be equivalent to the operation G. To express this in 
an equation we may write 

BA{x)^G{x)y 

* The letterB 17 , here denote the dietanoes of the plane 11 from the points 

Ay By G, I), divided by six times the \olume of the tetrahedron, A, By Cy D. The 
letters Xy y, 2 , tr, denote the tetrahedral coordinates as above. 

+ Ahhandl.y vol. xvi, p. 28. XAmtr. Joum. McUh.y vol. i, p. 350. 
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where x is any quantity or function. 


write 


A(x)-^B (x) — D{x), or 


We may also have occasion to 


But it is almost impossible to resist the tendency to express these 


relations in the form 


BA^C, 




in which the operators appear in a sense as (|uantities, i.e., as subjects 
of functional operation. Now since these operators are often of such 
nature that thej^ cannot be perfectly specified by a single numerical 
quantity, when we treat them as quantities they must be regarded as 
multiple quantities. In this way certain formula^ which essentially 
belong to multiple algebra get a precarious footing where they are 
only allowed because they are regarded as abridged notations for 
equations in ordinary algebra. Yet the logical development of such 
notations would lead a good way in multiple algebra, and doubtless 
many investigators have entered the field from this side. 

One might also notice, to show how the ordinary algebra is be¬ 
coming saturated with the notions and notations which seem destined 
to turn it into a multiple algebra, the notation so common in the 
higher algebra 

for ax-^hy-\- cz. 

This is evidently the same as Grassmann’s internal product of the 
multiple quantities (a, 6, c) and {x, y, z), or, in the language of 
quaternions, the scalar part, taken negatively, of the product of the 
vectors of which a, b, c and x, y, z are the components. A similar 
correspondence with Grassmann’s methods might, I think, be shown 
in such notations as, for example, 

(a, b, 6*, d){x, y)®. 

The free admission of such notations is doubtless due to the fact that 
they are regarded simply as abridged notations. 

The author of the celebrated “ Memoir on the Theory of Matrices " 
goes much farther than this in his use of the forms of multiple 
algebra. Thus he writes explicitly one equation to stand for several, 
without the use of any of the analytical artifices which have been 
mentioned. This work has indeed, as we have seen, been characterized 
as marking the commencement of multiple algebra,—a view to which 
we can only take exception as not doing justice to earlier writers. 

But the significance of this memoir with regard to the point which 
I am now considering is that it shows that the chasm so marked 
in the second quarter of this century is destined to be closed up. 
Notions and notations for which a Cayley is sponsor will not be 
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excluded from good society among mathematicians. And if we admit 
as suitable the notations used in this memoir (where it is noticeable 
that the author rather avoids multiple algebra, and only uses it very 
sparingly), we shall logically be brought to use a great deal more. 
For example, if it is a good thing to write in our equations a single 
letter to represent a matrix of ti* numerical quantities, why not use 
a single letter to represent the n quantities operated upon, as Grass- 
mann and Hamilton have done ? Logical consistency seems to de¬ 
mand it. And if we may use the sign X denote an operation by 
which two sets of quantities are combined to form a third set, as is 
the case in this memoir, why not use other signs to denote other 
functional operations of which the result is a multiple quantity ? 
If it be conceded that this is the proper method to follow where 
simplicity of conception, or brevity of expression, or ease of trans¬ 
formation is served thereby, our algebra will become in large part a 
multiple algebra. 

We have considered the subject a gocxi while from the outside; we 
have glanced at the principal events in the history of multiple algebra; 
we have seen how the course of modern thought seems to demand its 
aid, how it is actually leaning toward it, and beginning to adopt its 
methods. It may be worth while to direct our attention more 
critically to multiple algebra itself, and inquire into its essential 
character and its most important principles. 

I do not know that anything useful or interesting, which relates 
to multiple quantity, and can be symbolically expressed, falls outside 
of the domain of multiple algebra. But if it is asked, what notions 
are to be regarded as fundamental, we must answer, here as else¬ 
where, those which are most simple and fruitful. Unquestionably, no 
relations are more so than those which are known by the names 
of addition and multiplication. 

Perhaps I should here notice the essentially different manner in 
which the multiplication of multiple quantities has been viewed 
by different writers. Some, as Hamilton, or De Morgan, or Peirce, 
speak of the product of two multiple quantities, as if only one product 
could exist, at least in the same algebra. Others, as Grassmann, speak 
of various kinds of products for the same multiple quantities. Thus 
Hamilton seems for many years to have agitated the question, what 
he should regard as the product of each pair of a set of triplets, or 
in the geometrical application of the subject, what he should regard 
as the product of each pair of a system of perpendicular directed 
lines.* Grassmann asks. What products, i.e., what distributive 
functions of the multiple quantities, are most important 1 

* Phil, Mag. (S), vol. xxv, p. 490; North BrUieih Eeview^ vol. xlv, (1866), p. 57. 
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It may be that in some cases the fact that only one kind of product 
is known in ordinary algebra has led those to whom the problem 
presented itself in the form of finding a new algebra to adopt this 
characteristic derived from the old. Perhaps the reason lies deeper 
in a distinction like that in arithmetic between concrete and abstract 
numbers or quantities. The multiple quantities corresponding to 
concrete quantities such as ten apples or three miles are evidently 
such combinations as ten apples 4- seven oranges, three miles north- 
\Yard -h five miles eastward, or six miles in a direction fifty degrees 
east of north. Such are the fundamental multiple quantities from 
Grassmann’s point of view. But if we ask what it is in multiple 
algebra which corresponds to an abstract number like twelve, which 
is essentially an operator, which changes one mile into twelve miles, 
and SI ,000 into $12,000, the most general answer would evidently 
be: an operator which will work such changes as, for example, that 
of ten apples 4- seven oranges into fifty apples 4-100 oranges, or that 
of one vector into another. 

Now an operator has, of course, one characteristic relation, viz., its 
relation to the operand. This needs no especial definition, since it 
is contained in the definition of the operatc^r. If the operation 
is distributive, it may not inappropriately be called multiplication, 
and the result is par excellence the product of the operator and 
operand. The sum of operators q'lid operators, is an operator which 
gives for the product the sum of the products given by the operators 
to be added. The product of two operators is an operator which is 
equivalent to the successive operations of the factors. This multi¬ 
plication is necessarily associative, and its definition is not really 
different from that of the operators themselves. And here I may 
observe that Professor C. S. Peirce has shown that his father's 
associative algebras may be regarded as operational and matricular.* 

Now the calculus of distributive operators is a subject of great 
extent and importance, but Grassmanns view is the more compre¬ 
hensive, since it embraces the other with something besides. For 
every quantitative operator may be regarded as a quantity, i.e., as 
the subject of mathematical operation, but every quantity cannot 
be regarded as an operator; precisely as in grammar every verb may 
be taken as substantive, as in the infinitive, while every substantive 
does not give us a verb. 

Grassmann’s view seems also the most practical and convenient. 
For we often use many functions of the same pair of multiple 
quantities, which are distributive with respect to both, and we need 
some simple designation to indicate a property of such fundamental 


* Amer. Jowni. Math., vol. iv, p. 221. 
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importance in the algebra of such functions, and no advantage 
appears in singling out a particular function to be alone called 
the product. Even in quaternions, where Hamilton speaks of 
only one product of two vectors (regarding it as a special case 
of the product of quaternions, i.e., of operators), he nevertheless 
comes to use the scalar part of this product and the vector part 
separately. Now the distributive law is satisfied by each of these, 
which therefore may conveniently be called products. In this 
sense we have three kinds of products of vectors in Hamilton’s 
analysis. 

Let us then adopt the more general view of multiplication, and call 
any function of two or more multiple quantities, which is distributive 
with respect to all, a product, with only this limitation, that when 
one of the factors is simply an ordinary algebraic quantity, its effect 
is to be taken in the ordinary sense. 

It is to be observed that this definition of multiplication implies 
that we have an addition both of the kind of quantity to which the 
product belongs, and of the kinds to which the factors belong. Of 
course, these must be subject to the general formal laws of addition. 
1 do not know that it is necessary for the purposes of a general 
discussion to stop to define these operations more particularly, either 
on their own account or to complete the definition of multiplication. 
Algebra, as a formal science, may rest on a purely formal foundation. 
To take our illustration again from mechanics, we may say that 
if a man is inventing a particular machine,—a sewing machine, a 
reaper,—nothing is more important than that he should have a precise 
idea of the operation which his machine is to perform, yet when he is 
treating the general principles of mechanics he may discuss the lever, 
or the form of the teeth of wheels which will transmit uniform 
motion, without inquiring the purpose to which the apparatus is to 
be applied; and in like manner that if we were forming a particular 
algebra,—a geometrical algebra, a mechanical algebra, an algebra 
for the theory of elimination and substitution, an algebra for the 
study of quantics,—we should commence by asking. What are the 
inultiple quantities, or sets of quantities, which we have to consider ? 
What are the additive relations between them ? What are the multi¬ 
plicative relations between them ? etc., forming a perfectly defined 
and complete idea of these relations as we go along; but in the 
development of a general algebra no such definiteness of conception 
is requisite. Given only the purely formal law of the distributive 
character of multiplication,—this is sufficient for the foundation of a 
science. Nor will such a science be merely a pastime for an ingenious 
mind. It will serve a thousand purposes in the formation of parti¬ 
cular algebras. Perhaps we shall find that in the most important 
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cases the particular algebra is little more than an application or 
interpretation of the general. 

Grassmann observes that any kind of multiplication of n-fold 
quantities is characterized by the relations which hold between the 
products of n independent units. In certain kinds of multiplication 
these characteristic relations will hold true of the products of any 
of the quantities. 

Thus if the value of a product is independent of the order of the 
factors when these belong to the system of units, it will always be 
independent of the order of the factors. The kind of multiplication 
characterized by this relation and no other between the products is 
called by Grassmann algebraic, because its rules coincide with those 
of ordinary algebra. It is to be observed, however, that it gives 
rise to multiple quantities of higher orders. If n independent units 

n “p 1 

are required to express the original quantities, '^ 2 will \xt 


required for the products of two factors, 71 for the 

products of three factors, etc. 

Again, if the value of a prcnluct of bictors belonging to a system 
of units is multiplied by — 1 when two factors change places, the 
same vill be true of the product of any factors obtained by addition 
of the units. The kind of multiplication characterized by this relation 
and no other is called by Grassmann exte'nuil or c<yrribinatoTial. For 
our present purpose we may denote it by the sign x. It gives rise 

to multiple quantities of higher orders, n ^— units being required 


to express the products of two factors, n 


(n~l)(n~2) 

2.3 


units 


for 


products of three factors, etc. All products of more than n factors 
are zero. The products of n factors may be expressed by a single 
unit, viz., the product of the n original units taken in a specified 
order, which is generally set equal to 1 . The products of n — 1 factors 
are expressed in terms of n units, those of 7 ^ — 2 factors in terms of 

71 —2-"Units, etc. This kind of multiplication is associative, like the 
algebraic. 

Grassmann observes, with respect to binary products, that these 
two kinds of multiplication are the only kinds characterized by laws 
which are the same for any factors as for particular units, except 
indeed that characterized by no special laws, and that for which all 
products are zero.* The last we may evidently reject as nugatory. 
That for which there are no special laws, i.e., in which no equations 


Crelle’B /. Math.., vol. xlix, p. 138. 
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subsist between the products of a system of independent units, is also 
rejected by Grassmann, as not appearing to afford important applica¬ 
tions. I shall, however, have occasion to speak of it, and shall call 
it the indeterminate product. In this kind of multiplication, n* units 
are required to express the products of two factors, and units for 
products of three factors, etc. It evidently may be regarded as 
associative. 

Another very important kind of multiplication is that called by 
Grassmann internal. In the form in which I shall give it, which is 
less general than Grassmann s, it is in one respect the most simple of 
all, since its only result is a numerical quantity. It is essentially 
binary and characterized by laws of the form 

= /y=l, k.k — ly etc., 

i-j = 0, j. i = 0, etc., 

where i^j, k, etc., represent a system of independent units. I use the 
dot as significant of this kind of multiplication. 

Grassmann derives this kind of multiplication from the com¬ 
binatorial by the following process. He defines the complement 
(Erganzung) of a unit as the combinatorial product of all the other 
units, taken with such a sign that the combinatorial product of the 
unit and its complement shall be positive. The combinatorial product 
of a unit and its complement is therefore unity, and that of a unit 
and the complement of any other unit is zero. The internal product 
of two units is the combinatorial product of the first and the com¬ 
plement of the second. 

It is important to observe that any scalar product of two factors of 
the same kind of multiple quantities, which is positive when the 
factors are identical, may be regarded as an internal product, i.e., we 
may always find such a system of units, that the characteristic 
equations of the product will reduce to the above form. The nature 
of the subject may afford a definition of the product independent of 
any reference to a system of units. Such a definition will then have 
obvious advantages. An important case of this kind occurs in 
geometry in that product of two vectors which is obtained by multi¬ 
plying the products of their lengths by the cosine of the angle which 
they include. This is an internal product in Grassmann’s sense. 

Let us now return to the indeterminate product, which I am 
inclined to regard as the most important of all, since we may derive 
from it the algebraic and the combinatorial. For this end we will 
prefix S to an indeterminate product to denote the sum of all the 
terms obtained by taking the factors in every possible order. Then, 

2a|/3|y. 

for instance, where the vertical line is used to denote the 
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indeterminate productis a distributive function of a, /S, and y. It 
is evidently not affected by changing the order of the letters. It is, 
therefore, an algebraic product in the sense in which the term has 
been defined. 

So, again, if we prefix Si: to an indeterminate product to denote 
the sum of all terms obtained by giving the factors every possible 
order, those terms being taken negatively which are obtained by an 
odd number of simple permutations, 

2 ±a\^\y, 

for instance, will be a distributive function of a, y, which is 
multiplied by — 1 when two of these letters change places. It will 
therefore be a combinatorial pro(iuct. 

It is a characteristic and very important property of an indeter¬ 
minate product that every product of all its factors with any other 
quantities is also a product of the indeterminate product and the 
other quantities. We need not stop for a formal proof of this pro¬ 
position, which indeed is an immediate consequence of the definitions 
of the terms. 

These considerations bring us naturally to what Grassmann calls 
regressive multiplicationy which I will first illustrate by a very 
simple example. If n, the degree of multiplicity of our original 
quantities, is 4, the combinatorial product of ax/3xy and (5xe, viz., 

aX^XyxSxCy 

is necessarily zero, since the number of factors exceeds four. But if 
for ^X€ we set its equivalent 

we may multiply the first factor in each of these indeterminate pro¬ 
ducts combinatorially by aX^Xy, and prefix the result, which is a 
numerical quantity, as coefficient to the second factor. This will give 
(a X/3xy X (J) e — (a X/S X y X e) (5. 

Now, the first term of this expression is a product of axfixy, S, and 
€, and therefore, by the principle just stated, a product of axfixy 
and The second term is a similar product of aX/Bxy and e|<j. 

Therefore the whole expression is a product of axfixy and — 
that is, of aX^Xy and 8xe. That is, except in sign, what Grass¬ 
mann calls the regressiveprodv/it of ax/3xy and Sx€, 

To generalize this process, we first observe that an expression of 
the form 

S zt.aX^\yx8y 

in which e€mh term is an indeterminate product of two combinatorial 
products, and in which 2± denotes the sum of all terms obtained by 

* This notation must not bo confounded with Grassmann’s use of the vertical line. 
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putting every different pair of the letters before the dividing line, the 
negative sign being used for any terms which may be obtained by an 
odd number of simple permutations of the letters,—in other words, 
the expression 

aX/3\yxS — aXy\l3xS-axS\yXl3-{-/3xy\axS 
-- /3x8\aXy + yxS\aXl3, 

is a distributive function of a, /3, y, and S, which is multiplied by — 1 
when two of these letters change places, and may, therefore, be regarded 
as equivalent to the combinatorial product axftxyxS. Now, if 7?, = 5, 
the combinatorial product of 

pXcrXT and aX^XyxS 

is zero. But if we multiply the first member of each of the alx>ve 
indeterminate products by pXcrXT, and prefix the result as co¬ 
efficient to the second member, we obtain 

{pXcrXrXaX ^)yXS — (pXcrXTXaXy)/3X(5-f-etc., 

which is what (Irassrnaun calls the regressive product of pXarXr and 
aX^XyxS. It is easy to see that the principle may be extended so 
as to give a regressive product in any case in which the total number 
of factors of two combinatorial products is greater than ti. Also, that 
we might fonn a regressive product by treating the first of the given 
oombinatorials as we have treated the second. It may easily be shown 
that this would give the same result, except in some cases with a 
difference of sign. To avoid this inconvenience, we may make the 
rule, that whenever in the substitution of a sum of indeterminate 
products for a combinatorial, both factors of theindeterminate products 
are of odd degree, w^e change the sign of the wdiole expression. With 
this understanding, the results wdiich we obtain will he identical with 
Grassrnann’s regressive product. The propriety of the name consists 
in the fact that th(‘. product is of less degree than either of the factors. 
For the contrary reason, the ordinary external or combinatorial 
multiplication is sometimes called by Grassmann progressive. 

Regressive multiplication is associative and exhibits a very remark¬ 
able analogy with the progressive. This analogy I have not time 
here to develop, but will only remark that in this analogy lies in its 
most general form that celebrated of duality, which appears 

in various forms in geometry and certain branches of analysis. 

To fix our ideas, I may observe that in geometry the progressive 
multiplication of points gives successively lines, planes and volumes; 
the regressive multiplication of planes gives successively lines, points 
and scalar quantities. 

The indeterminate product affords a natural key to the subject of 
matrices. In fact, a sum of indeterminate products of the second 
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degree represents scalars, which constitute an ordinary or quadratic 
matrix; a sum of indeterminate products of the third degree repre¬ 
sents scalars, which constitute a cubic matrix, etc. I shall confine 
myself to the simplest and most important case, that of quadratic 
matrices. 

An expression of the form 

a(A.p) 

being a product of a, A, and p, may be regarded as a product of «|A 
and p, by a principle already stated. Now if ^ denotes a sum of 
indeterminate products, of second degree, say a|A + i61|iM + etc., we 
may write 

d>.p 

for «(\.p)-4-)8()U.p)“}-etc. 

This is like p, a quantity of the first degree, and it is a homogeneous 
linear function of p. It is easy to see that the most general form 
of such a function may he expressed in this way. An equation like 

0' = #.p 

represents n equations in ordinary algebra, in which n variables are 
expressed as linear functions of n others by means of coefficients. 

The internal product of two indeterminate products may be defined 
by the equation (a|0).(v|5) = (/3.y)al5. 

This defines the internal product of matrices, as 

This product evidently gives a matrix, the operation of which is 
equivalent to the successive operations of and ^; i.e., 

(S^.^).p-^.(^.p). 

We may express this a little more generally by saying that internal 
liiultiplication is associative when performed on a series of matrices, 
or on such a series terminated by a quantity of the first degree. 

Another kind of multiplication of binary indeterminate products 
is that in which the preceding factors are multiplied combinatorially, 
and also the following. It may be defined by the equation 

(a|A)J(^|M)SS(yl»') = aX^Xy|Axpxj/. 

This defines a multiplication of matrices denoted by the same symbol. 

This multiplication, which is associative and commutative, is of great 
importance in the theory of determinants. In fact, 

1 
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is the determinant of the matrix A lower power, as the m*'^, with 
the divisor —1) ... (ti —m-h 1) would express as multiple quantity 
all the subdeterminants of order m* 

It is evident that by the combination of the operations of inde¬ 
terminate, algebraic, and combinatorial multiplication we obtain 
multiple quantities of a more complicated nature than by the use 
of only one of these kinds of multiplication. The indeterminate 
product of combinatorial products we have already mentioned. The 
combinatorial product of algebraic products, and the indeterminate 
product of algebraic products, are also of great importance, especially 
in the theory of quantics. These three multiplications, with the 
internal, especially in connection with the general property of the 
indeterminate product given above, and the derivation of the algebraic 
and combinatorial products from the indeterminate, which affords a 
generalization of that property, give rise to a great wealth of multi¬ 
plicative relations between these multiple quantities. I say “ wealth 
of multiplicative relations” designedly, for there is hardly any kind 
of relations between things which ere the objects of mathematical 
study, which add so much to the resources of the student as those 
which we call multiplicative, except perhaps the simpler class which 
we call additive, and which are presupposed in the multiplicative. 
This is a truth quite independent of our using any of the notations 
of multiple algebra, although a suitable notation for such relations 
will of course increase their value. 

Perhaps, before closing, I ought to say a few words on the appli¬ 
cations of multiple algebra. 

First of all, geometry, and the geometrical sciences which treat of 
things having position in space, kinematics, mechanics, astronomy 
physics, crystallography, seem to demand a method of this kind, for 
position in space is essentially a multiple quantity and can only be 
represented by simple quantities in an arbitrary and cumbersome 
manner. For this reason, and because our spatial intuitions are more 
developed than those of any other class of mathematical relations, 
these subjects are especially adapted to introduce the student to the 
methods of multiple algebra. Here, Nature herself takes us by the 
hand and leads us along by easy steps, as a mother teaches her child 
to walk. In the contemplation of such subjects, Mdbius, Hamilton, 

* Quadratic matrices may also be represented by a sum of indeterminate products of 
a quantity of the first degree with a oombinatorial product of {n - 1 )st degree, as, for 
example, when m = 4, by a sum of products of the form 

al^xyxS. 

The theory of such matrices is almost identical with that of those of the other form, 
except that the external multiplication takes the place of the internal, in the multipli> 
cation of the matrices with each other and with quantities of the first degree. 
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and Grassmann formed their algebras, although the philosophical 
mind of the last was not satisfied until he had produced a system 
unfettered by any spatial relations. It is probably in connection with 
some of these subjects that the notions of multiple algebra are most 
widely disseminated. 

Maxwell’s Treatii^e on Electricity ai\(l Maynetimn has done so 
much to familiarize students of physics with quaternion notations, 
that it seems impossible that this subject should ever again be entirely 
divorced from the methods of multiple algebra. 

I wish that I could say as much of astronomy. It is, 1 think, to V)e 
regretted, that tlie oldest of the scientific applications of mathematics, 
the most dignified, the most conservative, should keep so far aloof 
from the youngest of mathematical methods; and standing as I do 
to-day, by some chance, among astronomers, although not of the guild, 
I cannot but endeavor to improve the opportunity by expressing my 
conviction of the advantages which astronomers might gain by 
employing some of the methods of multiple algebra. A very few of 
the fundamental notions of a vector analysis, the addition of vectors 
and what quaternionists would call the scalar part and the vector 
part of the product of two vectors (which may be defined without 
the notion of the (luaternion),—these three notions with some four 
fundamental properties relating to them are sufficient to reduce 
enormously the labor of mastering such subjects as tlie elementary 
theory of orbits, the determination of an orbit from three observations, 
the differential equations which are used in determining the best orbit 
from an indefinite number of observations by the method of least 
8<|uare8, or those which give the perturbations when the elements are 
treated as variable. In all these subjects the analytical w^ork is 
greatly simplified, and it is far easier to find the best form for 
numerical calculation than by the use of the ordinary analysis. 

I may here remark that in its geometrical applications multiple 
algebra will naturally take one of two principal forms, according as 
vectors or points are taken as elementary quantities, i.e., according as 
something having magnitude and direction, or something having 
magnitude and position at a point, is the fundamental conception. 
These forms of multiple algebra may be distinguished as vector 
analysis and point analysis. The former we may call a triple, the 
latter a quadruple algebra, if we determine the degree of the algebra 
from the degree of multiplicity of the fundamental conception. The 
former is included in the latter, since the subtraction of points gives 
us vectors, and in this w’ay Grassmann’s vector analysis is included in 
his point analysis. Hamilton’s system, in which the vector is the 
fundamental idea, is nevertheless made a quadruple algebra by the 
addition of ordinary numerical quantities. For practical purposes we 
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may regard Hamilton’s system as equivalent to Grassmann’s algebra 
of vectors. Such practical equivalence is of course consistent with 
great differences of notation, and of the point of view from which the 
subject is regarded. 

Perhaps I should add a woixi in regard to the nature of the problems 
which require a vector analysis, or the more general form of Grass- 
mann’s point analysis. The distinction of the problems is very marked, 
and corresponds precisely to the distinction familiar to all analysts 
between problems which are suitable for Cartesian coordinates, and 
those which are suitable for the use of tetrahedral, or, in plane 
geometry, triangular coordinates. Thus, in mechanics, kinematics, 
astronomy, physics, or crystallography, Grassmann’s point analysis 
will rarely be wanted. One might teach these subjects for years by a 
vector analysis, and never perhaps feel the need of any of the notions 
or notations which are peculiar to the point analysis, precisely as in 
ordinary algebra one might use the Cartesian coordinates in teaching 
these subjects, without any occasion for the use of tetrahedral coor¬ 
dinates. I think of one exception, which, however, confirms the 
rule. The very important theory of forces acting on a rigid body is 
much better treated by point analysis than by vector analysis, exactly 
as in ordinary algebra it is much better treated by tetrahedral coor¬ 
dinates than by Caiiesian,—I mean for the purpose of the elegant 
development of general propositions. A sufficient theory for the 
purposes of numerical calculations can easily enough be given by any 
method, and the most familiar to the student is for such practical 
purposes of course the best. On the other hand, the projective pro¬ 
perties of bodies, the relations of collinearity, and similar subjects, 
seem to demand the point analysis for their adequate treatment. 

If I have said that the algebra of vectors is contained in the algebra 
of points, it does not follow that in a certain sense the algebra of 
points is not deducible from the algebra of vectors. In mathematics, 
a part often contains the whole. If we represent points by vectors 
drawn from a common origin, and then develop those relations 
between such vectors representing points, which are independent of 
the position of the origin,—by this simple process we may obtain a 
large part, possibly all, of an algebra of points. In this way the 
vector analysis may be made to serve very conveniently for many of 
those subjects which I have mentioned as suitable for point analysis. 
The vector analysis, thus enlarged, is hardly to be distinguished from 
a point analysis, but the treatment of the subject in this way has 
somewhat of a makeshift character, as distinguished from the unity 
and simplicity of the subject when developed directly from the idea of 
something situated at a point. 

Of those subjects which have no relations to space, the elementary 



116 


MULTIPLE ALGEBRA. 


theory of eliminations and substitutions, including the theory of 
matrices and determinants, seems to afford the most simple application 
of multiple algebra. I have already indicated what seems to me the 
appropriate foundation for the theory of matrices. The method is 
essentially that which Grassmann has sketched in his first Ausdeh^i- 
ungslehre under the name of the open product and has developed 
at length in the second. 

In the theory of quantics Grassmann s algebraic product finds an 
application, the quantic appearing as a sum of algebraic products in 
Grassmann s sense of the term. As it has been stated that these 
products are subject to the same laws as the ordinary products of 
algebra, it may seem that we have here a distinction without an 
important difference. If the quantics were to be subject to no farther 
multiplications, except the algebraic in Grassmann s sense, such an 
objection would be valid. But quantics regarded as sums of algebraic 
products, in Grassmann s sense, are multipie quantities and subject to 
a great variety of other multiplications than the algebraic, by which 
they were formed. Of these the most important are doubtless the 
combinatorial, the internal, and the indeterminate. I'he combinatorial 
and the internal may be applied, not only to the quantic as a whole or 
to the algebraic products of which it consists, but also to the indi¬ 
vidual factors in each term, in accordance with the general principle 
which has been stated with respect to the indeterminate product and 
which will apply also to the algebraic, since the algebraic may be 
regarded as a sum of indeterminate products. 

In the differential and integral calculus it is often advantageous 
to regard as multiple quantities various sets of variables, especially 
the independent variables, or those which may be taken as such. 
It is often convenient to represent in the form of a single differential 
coefficient, as 

cZt 

dp' 

a block or matrix of ordinary differential coefficients. In this 
expression, p may be a multiple quantity representing say n inde¬ 
pendent variables, and r another representing perhaps the same 
number of dependent variables. Then dp represents the n differ¬ 
entials of the former, and dr the n differentials of the latter. The 
whole expression represents an operator which turns dp into c?t, 
so that we may write identically 

Here we see a matrix of differential coefficients represented by 
a quotient. This conception is due to Grassmann, as well as the 
representation of the matrix by a sura of products, which we have 
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already considered. It is to be observed that these multiple differ¬ 
ential coefficients are subject to algebraic laws very similar to those 
which relate to ordinary differential coefficients when there is a 
single independent variable, e.g., 

d<T dr^d(T 
3t dp dp* 

dp dr^^^ 
dr dp 

In the integral calculus, the transformation of multiple integrals 
by change of variables is made very simple and clear by the methods 
of multiple algebra. 

In the geometrical applications of the calculus, thei'e is a certain 
class of theorems, of which Green’s and Poisson’s are the most 
notable examples, wluch seem to have been first noticed in connection 
with certain physical theories, especially those of electricity and 
magnetism, and which have only recently begun to find their way 
into treatises on the calculus. These not only find simplicity of 
expression and demonstration in the infinitesimal calculus of multiple 
quantities, but also their natural position, which they hardly seem 
to find in the ordinary treatises. 

But I do not so much desire to call your attention to the diversity 
of the applications of multiple algebra, as to the simplicity and 
unity of its principles. The student of multiple algebra suddenly 
finds himself freed from various restrictions to which he has been 
accustomed. To many, doubtless, this liberty seems like an invi¬ 
tation to license. Here is a boundless field in which caprice may 
riot. It is not strange if some look with distrust for the result 
of such an experiment. But the farther we advance, the more 
evident it becomes that this too is a realm subject to law. The 
more we study the subject, the more we find all that is most useful 
and beautiful attaching itself to a few central principles. We 
begin by studying multiple algeh^as; we end, I think, by studying 
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ON THE DETERMINATION OF ELLIPTIC ORBITS FROM 
THREE COMPLETE OBSERVATIONS. 

\Memoirs of the Natitmnl Academy of Sciences, vol. iv. part il. 
pp. 79-104, 1889.] 

The determination of an orbit from three complete obsei'vations 
by the solution of the equations which represent elliptic motion 
presents so great difficulties in the general case, that in the first 
solution of the problem we must generally limit ourselves to the case 
in which the intervals between the observations are not very long. In 
this case we substitute some comparatively simple relations between 
the unknown quantities of the problem, which have an approximate 
validity for short intervals, for the less manageable relations which 
rigorously subsist between these quantities. A comparison of the 
approximate solution thus obtained with the exact laws of elliptic 
motion will always afford the means of a closer approximation, and 
by a repetition of this process we may arrive at any required degree 
of accuracy. 

It is therefore a problem not without interest—it is, in fact, the 
natural point of departure in the study of the determination of orbits 
—to express in a manner combining as far as possible simplicity and 
accuracy the relations between three positions in an orbit separated 
by small or moderate intervals. The problem is not entirely deter¬ 
minate, for we may lay the greater stress upon simplicity or upon 
accuracy; we may seek the most simple relations which are sufficiently 
accurate to give us any approximation to an orbit, or we may seek 
the most exact expression of the real relations, which shall not be too 
complex to be serviceable. 

Deriyation of the Fundamental Equation. 

The following very simple considerations afiford a vector equation, 
not very complex and quite amenable to analytical transformation, 
which expresses the relations between three positions in an orbit 
separated by small or moderate intervals, with an accuracy far 
exceeding that of the approximate relations generally used in the 
determination of orbits. 
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If we adopt Biich a unit of time that the acceleration due to the 
sun’s action is unity at a unit’s distance, and denote the vectors'*^ 
drawn from the sun to the lx>dy in its three positions by ^3> 

and the lengths of these vectors (the heliocentric distances) by 
r^, the accelerations corresponding to the three positions will be 


represented by 






Now the motion between the 


positions considered may be expressed with a high degree of accuracy 


by an equation of the form 




having five vector constants. The actual motion rigorously satisfies 
six conditions, viz., if we write Tg for the interval of time between the 


* Vectors, or directed quarititieSy will he represented in this paper hy German capitals. 
The following notations will be used in connection with them : 

The sign = denotes identity in direction as well as length. 

The sign 4- denotes geometrical addition, or what is called composition in mechanics. 

The sign - denotes reversal of direction, or composition after reversal. 

The noUition ^21.^ denotes the product of the lengths of the vectors and the cosine of 
the angle which they include. It will be called the direct product of 9t and If 
Xy y, z are the rectangular components of 21, and x\ y\ %’ those of 

■31. $? = xx' + yy' 4- 22 '. 

31.31 may be written 31® and called the square of 31. 

The notation 31x58 will be used to denote a vector of which the length is the product 
of tlie lengths of 31 and i8 and the sine of the angle which they include. Its direction 
is perpendicular to 31 and 58, and on that side on which a rotation from 31 to 56 
appears counter-clockwise. It will be called the skew jyroduct of 3( and 33, If the 
rectangular components of 31 and 33 are x, y, 2 , and x', y', 2 ', those of 31x33 will be 
yz' - zy', 2 x' - xz\ xy' - yx'. 

The notation (5S[33<5) denotes the volume of the parallelepiped of which three edges are 
oV)taincd by laying off the vectors 31, 33, and (£ from any same x>oint, which volume is to 
be taken positively or negatively, according as the vector (5 falls on the side of the plane 
containing 31 and 33, on which a rotation from 31 to 33 appears counter-clockwise, or on 
the other side. If the rectangular components of 3X, 3b and (S are x, y, 2 ; x', y', 2 '; 
and x", y", z", 

(9t«e)= !• 

x" y" 2 " 

It follows, from the above definitions, that for any vectors 31, 56, and (S 
31.6-6.31, 31x6=-6x31. 

(316(5) = (6(£3l) = ((£316)= -(31(56)= -(Gf63r)= -(63l(£), 
and (3l6(£) = 3l.(6xg) = 6.((£x3l) = (£.(3lx6); 

also that 31.6, ^ x6, are distributive functions of 31 and 6, and (316(5) a distributive 
function of 31, 6, and (£, for example, that if 31 = 534-21?, 

31.6 = 3.6 + 2)?.6, 3tx6 = Vx6 + 2)?x6, (3X6(5) = (36(5) + (33?6(5), 

and so for 6 and (5. 

The notation (316(5) is identical with that of Lagrange in the M^canique Anulytique, 
except that there its use is limited to unit vectors. The signification of SXx6 is closely 
related to, but not identical with, that of the notation commonly used to denote 

the double area of a triangle determined by two positions in an orbit. 
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first and second positions, and for that between the second and 
third, and set ^« 0 for the second position, 

for t = — To, 


for t = 0, 
for t = 


9^ = %, 


dm 

dm 

dm 




« 3 ’ 
'2 




We may therefore write with a high degree of approximation 

^ - Ts« d-- T3«^) + Ts* (5 

9^5 = « + -h Ti*(5 + - V (5 

5P 

-p3 = 2(5-6T35) + 12T3^(S 


-^ = 
r,» 

-^ = 


26 


26+6ti®+12t,*6. 


From these six equations the five constants 81, ©, 6.1), 6 may be 
eliminated, leaving a single equation of the form 


^.(l +^) (l -^) +^j) 91.=0, (1) 


where 


A -—Is— 

, » -^8“_1_» 


=tV(-Ti*+T,T, + T,*). 5, = T>j(Tl* + 9n^» +V) 

St = iV(Ti* + TiT, - T,'). 

This we shall call our fundamental equation. In order to discuss 
its geometrical signification, let us set 

%='^i(l+^)> = '"s = -^«(l+^)' (2) 

* 1 ' % 8 

SO that the equation will read 

— 7129^2 “ H '^8 913 = 0 . ( 3 ) 

This expresses that the vector is the diagonal of a parallelogram 
of which 71 i91i and TijSlg are sides. If we multiply by SRg and by 91i, 
in skew multiplication t we get 

Wi91ix9l8-n291gx9l8=0, —7ij91iX91j4-W891ix9l8*0, (4) 

912x918 ^ 91,x9l3 _ 91,x9l2 

til “ -^2 * 7^8 ’ 


whence 


(5) 
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Our equation may therefore be regarded as signifying that the three 
vectors plane, and that the three triangles 

determined each by a pair of these vectors, and usually denoted by 
['^ 2 ^ 8 ]» proportional to 


0 ~§)’ 


Since this vector equation is equivalent to three ordinary equations, 
it is evidently sufficient to determine the three positions of the body 
in connection with the conditions that these positions must lie upon 
the lines of sight of three observations. To give analytical expression 
to these conditions, we may write (^ 2 * vectors drawn 

from the sun to the three positions of the earth (or, more exactly, of 
the observatories where the observations have been made), 5 i. 82 » 5 * 
for unit vectors drawn in the directions of the body, as observed, 
and pi, P 2 , p 3 for the three distances of the body from the places of 
observation. We have then 




By substitution of these values our fundamental equation becomes 


^i(i+^)(et+/»ie,)-(i-||)(62+P2g.)+^,(i+^3)(e3+/>,5«)=o, (7) 

where pj, pg* Pa* '^2’ ’’a geocentric and heliocentric distances) 
are the only unknown quantities. From equations ( 6 ) we also get, 
by squaring l:K)th members in each, 

by which the values of r^, rg, ^3 may be derived from those of 
Pi, pg, P 3 , or vice versd. Equations (7) and ( 8 ), which are equivalent 
to six ordinary equations, are sufficient to determine the six quantities 
i\, rg, ?' 3 , pi, pg, P 3 ; or, if we suppose the values of rg, in terms 
of Pi, Pi, Ps t/O be substituted in equation (7), we have a single vector 
equation, from which we may determine the three geocentric distances 

Pi» P2» Pa* 

It remains to be shown, first, how the numerical solution of the 
equation may be performed, and secondly, how such an approximate 
solution of the actual problem may furnish the basis of a closer 
approximation. 


Solution of the Fundamental Equation. 

The relations with which we have to do will be rendered a little 
more simple if instead of each geocentric distance we introduce the 
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distance of the body from the foot of the perpendicular from the sun 
upon the line of sight. If we set 


equations ( 8 ) become 




Let us also set, for brevity, 


4 ' 


9s Ps "h (^3'^s)’ 

( 9 ) 

= ( 10 ) 


( 11 ) 



1 

■ ( 12 ) 


Then ©j, <© 2 ^ he regarded as functions respectively of 

Pi’ p 2 » Ps> therefore of q^y and if we set 



II 

II 

h 

11 

(13) 

and 

® = ®j + ®2+®3, 

(14) 

we shall have 

d<B = + ©"Vig’j, 

(16) 

To determine the value of (B\ we get by differentiation 



= + 5 ^). (16) 

But by ( 11 ) 



^ r,' 

(17) 

Therefore 

®-- ( 1 ’-f,,-.)®. 

(18) 


Now if any values of q^y q^y q^ (either assumed or obtained by a 
previous approximation) give a certain residual © (which would be 
zero if the values of q^, q^, satisfied the fundamental equation), and 
we wish to find the corrections ^q^y^Aq^, which must be added 
to q^y q^ to reduce the residual to zero, we may apply equation (15) 
to these finite differences, and will have approximately, when these 
differences are not very large, 

- © = (B'Aq^ + <B"Aq2 + 0'"A^8- 


(19) 
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This gives * 


A(7,= 




A92=- 




A^8=- 




( 20 ) 


From the corrected values of 9i. 9^, calculate a new residual ©, 

and from that determine another correction for each of the quantities 

9i. 92. 9.- 

It will sometimes be worth while to use formulae a little less simple 
for the sake of a more rapid approximation. Instead of equation (19) 
we may write, with a higher degree of accuracy, 

-® = @'A9i + ®''A92+®"'A9»+ ir(A9,)^+lX"(A9i,)®+ir"(A9,)=, (21) 


where 


^ d9j2 <lq, + I 


-3-" 


2B. 




%- 


B, 


d^r,-^) 


T-... d^<Bs „ , D d(r,-^)^ 


1—(^92® * 


( 22 ) 


It is evident that X' is generally many times greater than X or X", 
the factor in the case of equal intervals, being exactly ten times as 
great as or A^B^. This shows, in the first place, that the accurate 

determination of Aqz is of tho most importance for the subsequent 
approximations. It also shows that we may attain nearly the same 
accuracy in writing 

~ 0 = 0'A^i + A^a H- iX'Aq^^- ( 23) 

We may, however, often do a little better than this without using 
a more complicated equation. For X-\-X" may be estimated very 
roughly as equal to \X\ Whenever, therefore, Aq^ and Aq^ are about 
as large as Aq^, as is often the case, it may be a little better to use the 
coefficient instead of ^ in the last term. 

For Aq 2 , then, we have the equation 

- (©©"'©') = (©'©"0"') Ag^-h tV(2:"©'"©') (24) 

{X'^'"^') is easily computed from the formula 

92 '- "^ 2 ^^ n ( 25 ) 

1 

which may be derived from equations (18) and (22). 


* These equations are obtained by taking the direct products of both members of the 
preceding equation with x x and ©' x S", respectively. See footnote 

on page 119. 



124 


DETERMINATION OF ELLIPTIC ORBITS. 


The quadratic equation (24) gives two values of the correction to be 
applied to the position of the body. When they are not too large, 
they will belong to two different solutions of the problem, generally 
to the two least removed from the values assumed. But a very large 
value of must not be regarded as affording any trustworthy 
indication of a solution of the problem. In the majority of cases we 
only care for one of the roots of the equation, which is distinguished 
by being very small, and which will be most easily calculated by a 
small correction to the value which we get by neglecting the quadratic 
term.^ 

When a comet is somewhat near the earth we may make use of the 
fact that the earth’s orbit is one solution of the problem, i.e., that — 
is one value of Aq^, to save the trilling labor of computing the value 
of For it is evident from the tlieory of equations that 

if — and 2 ^ are the two roots, 

P2 


Eliminating we have 


whence 

Now value of A^g* which we obtain if we neglect 

the quadratic term in equation (24). If we call this value [A^g]* we 
have for the more exact value t 


A^2 = 


1+M' 


(26) 


Pi 

The quantities A^^ and Aq^ might be calculated by the equations 
- (00^0") = (0'0"0'") Aq^ + ^ (X" 0' 0") Aq^^ ] 


* In tho case of Swift’s comet (V, 1880), the writer found by the quadratic equation 
- -247 and - 116 for corrections of the assumed geocentric distance *250. The first of 
these numbers gives an approximation to the position of the earth ; the second to that 
of the comet, viz., the geocentric distance 134 instead of the true value ’ISSS. The 
coefficient ^ was used in the quadratic equation; with the coefficient ^ the approxi¬ 
mations would not be quite so good. The value of tho correction obtained by neglecting 
the quadratic term was *079, which indicates that the approximations (in this very 
critical case) would be quite tedious without the use of the quadratic term. 

tin the case mentioned in tho preceding footnote, from [Agjs - *979 and p,=s*25, 
we get - *1166, which is sensibly the same value as that obtained by calculating 

the quadratic term. 
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But a little examination will show that the coefficients of in 
these equations will not generally have very different values from the 
coefficient of the same quantity in equation (24). We may therefore 
write with sufficient accuracy 


A?i = [A?i] + Ag’ 2 -[Ag'J, Ag 3 =[A 93 J+Agj-[Ags], (28) 

where [AgJ, [A 92 ]» [A 93 ] denote the values obtained from equations ( 20 ). 

In making successive corrections of the distances 9 ^, 92 * ^s’ 
not be necessary to recalculate the values of (S', 0 ", 0 '", when these 
have been calculated from fairly good values of q^, q^. But when, 
as is generally the case, the first assumption is only a rude guess, the 
values of 0 ', 0 ", 0 '" should be recalculated after one or two corrections 
of gp 92 , gg. To get the best results when we do not recalculate 0', 
0", 0'", we may proceed as follows: Let 0', 0", 0'" denote the values 
which have been calculated; Dg^, Dq^, Dq^, respectively, the sum of 
the corrections of each of the quantities g^, 92 , 93 , which have been 
made since the calculation of 0 ', 0 ", 0 '"; the residual after all the 
corrections of g^, 92 , 93 , which have been made; and Ag^, Agj, Agg 
the remaining corrections which we are seeking. We have, then, 
very nearly 

-<B = {<B' + T(Dqi+iAq^))Aqi+ {^"+T'(Dq2+ iAg,)}Aq,) ^29) 

+ {^'" + Z'''(Dq, + iAq,)}Aq,. I 


The same considerations which we applied to equation (21) enable 
us to simplify this equation also, and to write with a fair degree of 
accuracy 

--(00'''0')={(0'0''0'")+|(r'0'''0')(i)g2 + iAg 2 )}Ag 2 , (30) 

A?, = [A^i]+A92-[Ag,], Ag,= [A95] + A?,-[A? 2 ], (31) 

where 


[A?i]=- 




[Aq^= - 


(©©'"©') 


tA9al=' 


(©©'©") 


.(32) 


Correction of the Fundamental Equation. 

When we have thus determined, by the numerical solution of our 
fundamental equation, approximate values of the three positions of 
the body, it will always be possible to apply a small numerical 
correction to the equation, so as to make it agree exactly with the 
laws of elliptic motion in a fictitious case differing but little from 
the actual. After such a correction the equation will evidently apply 
to the actual case with a much higher degree of approximation. 

There is room for great diversity in the application of this principle. 
The method which appears to the writer the most simple and direct is 
the following, in which the correction of the intervals for aberration 
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is combined with the correction required by the approximate nature of 
the equation.* 

The solution of the fundamental equation gives us three points, 
which must necessarily lie in one plane with the sun, and in the lines 
of sight of the several observations. Through these points we may 
pass an ellipse, and calculate the intervals of time required by the 
exact laws of elliptic motion for the passage of the body between 
them. If these calculated intervals should be identical with the given 
intervals, corrected for aberration, we would evidently have the true 
solution of the problem. But suppose, to fix our ideas, that the 
calculated intervals are a little too long. It is evident that if we 
repeat our calculations, using in our fundamental equation intervals 
shortened in the same ratio as the calculated intervals have come out 
too long, the intervals calculated from the second solution of the 
fundamental equation must agree almast exactly with the desired 
values. If necessary, this process may be repeated, and thus any 
required degree of accuracy may be obtained, whenever the solution of 
the uncorrected equation gives an approximation to the true positions. 
For this it is necessary that the intervals should not be too great. It 
appears, however, from the results of the example of Ceres, given 
hereafter, in which the heliocentric motion exceeds 62° but the 
calculated value.s of the intervals of time differ from the given values 
by little more than one part in two thousand, that we have here not 
approached the limit of the application of our formula. 

In the usual terminology of the subject, the fundamental equation 
with intervals uncorrected for aberration represents the first hypothesis; 
the same equation with the intervals affected by certain numerical 
coefficients (differing little from unity) represents the second hypothesis; 
the third hypothesis, should such be necessary, is represented by a 
similar equation with corrected coeflScients, etc. 

In the process indicated there are certain economies of labor which 
should not be left unmentioned, and certain precautions to be observed 
in order that the neglected figures in our computations may not 
unduly influence the result. 

It is evident, in the first place, that for the correction of our funda¬ 
mental equation we need not trouble ourselves with the position of the 
orbit in the solar system. The intervals of time, which determine this 
correction, depend only on the three heliocentric distances r^ and 

the two heliocentric angles, which will be represented by — and 
we write for the true anomalies. These angles 

^ 3 "“^’ 2 ) determined from r^, r^, r^ and n^, n^, 


* When an approximate orbit is known in advance, we may correct the fundamental 
equation at once. The forniulfe will be given in the Summary, §xii. 
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and therefore from rg, and the given intervals. For our funda¬ 
mental equation, which may be written 

indicates that we may form a triangle in which the lengths of the 
sides shall be and (let us say for brevity, s^), and 

the directions of the sides parallel with the three heliocentric directions 
of the body. The angles opposite .s^ and will be respectively — 
and We have, therefore, by a well-known formula, 


tan 


tan 


! ^/(s ,+82 + S3)( —«,+S2 + *3) I 

V («.+«3 


+ «3)(s,+«.,-S,) 


(34) 


As soon, therefore, as the solution of our fundamental equation 
has given a sufficient approximation to the values of r^, rj (say 
five- or six-figure values, if our final result is to be as exact as 
seven-figure logarithms can make it), we calculate n^, with 

seven-figure logarithms by equations (2), and the heliocentric angles 
by ecjuations (34). 

The aerni-parameter corresponding to these values of the heliocentric 
distances and angles is given by the equation 


p _ V. ^2'^'2 ( 35) 

The expression which occurs in the value of the semi- 

parameter, and the expression or —which 

occurs both in the value of the semi-parameter and in the formulae for 
determining the heliocentric angles, represent small quantities of the 
second order (if we call the heliocentric angles small quantities of the 
first order), and cannot be very accurately determined from approxi¬ 
mate numerical values of their separate terms. The first of these 
quantities may, however, be determined accurately by the formula 


71 1 —tig-f Tig 


A& 



(36) 


With respect to the quantity Sj —^ little consideration will show 
that if we are careful to use the same value wherever the expression 
occurs, both in the formulae for the heliocentric angles and for the 
semi-parameter, the inaccuracy of the determination of this value from 
the cause mentioned will be of no consequence in the process of 
correcting the fundamental equation. For although the logarithm 
of —52 + ^8 calculated by seven-figure logarithms from rg, 
may be accurate only to four or five figures, we may regard it as 
absolutely correct if we make a very small change in the value of one 
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of the heliocentric distances (say r^. We need not trouble ourselves 
farther about this change, for it will be of a magnitude which we 
neglect in computations wdth seven-figure tables. That the helio¬ 
centric angles thus determined may not agree as closely as they might 
with the positions on the lines of sight determined by the tirst solution 
of the fundamental equation is of no especial consequence in the 
correction of the fundamental equation, which only requires the exact 
fulfilment of two conditions, viz., that our values of the heliocentric 
distances and angles shall have the relations required by the funda¬ 
mental equation to the given intervals of time, and that they shall 
have the relations required by the exact laws of elliptic motion to 
the calculated intervals of time. The third condition, that none of 
these values shall differ too widely from the actual values, is of a 
looser character. 

After the determination of the heliocentric angles and the semi¬ 
parameter, the eccentricity and the true anomalies of the three 
positions may next be determined, and from these the intervals of 
time. These processes require no especial notice. The appropriate 
formulae will be given in the Summary of Formulae. 

Determination of the Orbit f^om the Three Positions and the 
Intervals of Time. 

The values of the semi-parameter and the heliocentric angles as 
given in the preceding paragraphs depend upon the quantity 
the numerical determination of which from cSj, and is critical to 
the second degree when the heliocentric angles are small. This was 
of no consequence in the process which we have called the cor^rection 
of the fundamental equatio7i. But for the actual determination of the 
orbit from the positions given by the corrected equation—or by the 
uncorrected equation, when we judge that to be sufficient—a more 
accurate determination of this quantity will generally be necessary. 
This may be obtained in different ways, of which the following is 
perhaps the most simple. Let us set 

0, = ©3-^i. ( 37 ) 

and 8^ for the length of the vector 0 ^, obtained by taking the square 
root of the sum of the squares of the components of the vector. It is 
evident that ^2 is the longer and the shorter diagonal of a parallelo¬ 
gram of which the sides are Sj and Sg. The area of the triangle having 
the sides Sj, Sg? i® therefore equal to that of the triangle having the 
sides Sj, Sg, S 4 , each being one-half of the parallelogram. This gives 

(8j + Sj + SjX - 8, + 8j, + 83)(8 i “ 8, + 83X8, + 8, - 8,) 

= («1 + 8, + 83X - 8, + 8, + 83X8, - S', + S3X «, + 8, - 83), ( 38 ) 
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and 


^2 ' 


- ( ^1 ^4 + ^ 3 )( + '^4 ggX^l ^4 + ^ 3 ) (^1 + ^4 - ^s) 

(.S*i -f 82 + ^sX - + «2 + '‘^ 3)('^1 + ^2 - •%) 


(39) 


The numerical determination of this value of —^2 + ^3 critical 
only to the first degree. 

The eccentricity and the true anomalies may be determined in the 
same way as in the correction of the formula. The position of 
the orbit in space may be derived from the following considerations. 
The vector —^2 directed from the sun toward the second position 
of the body; the vector 0^ from the first to the third position. 
If we set 

®5 = ® 4 -%#® 2 . ( 40 ) 

”2 


the vector 0^ will be in the plane of the orbit, perpendicular to —02 
and on the side toward which anomalies increase. If we write for 
the length of 0fi, 




and 




will be unit vectors. Let 3 and 3' be unit vectors determining the 
position of the orbit, 3 being drawn from the sun toward the peri¬ 
helion, and 3' at right angles to 3, in the plane of the orbit, and on 
the side toward which anomalies increase. Then 


3 = — cos Vo —- — sin '1^2(4*^) 

3' = — sin -h cos (42) 

The time of perihelion passage {T) may be determined from any 
one of the observations by the equation 

~{t~-T) —E--e sin E, (43) 

the eccentric anomaly E being calculated from the true anomaly v. 
The interval t — T in this equation is to be measured in days. A 
better value of T may be found by averaging the three values given 
by the separate observations, with such weights as the circumstances 
may suggest. But any considerable differences in the three values 
of T would indicate the necessity of a second correction of the 
formula, and furnish the basis for it. 

For the calculation of an ephemeris we have 

01^=: ^a€34-cos J?a3“hsin j^63' (44) 

in connection with the preceding equation. 

Sometimes it may be worth while to make the calculations for the 
correction of the formula in the slightly longer form indicated for 
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the determination of the orbit. This will be the case when we wish 
simultaneously to correct the formula for its theoretical imperfection, 
and to correct the observations by comparison with others not too 
remote. The rough approximation to the orbit given by the un- 
corrected formula may be sufficient for this purpose. In fact, for 
observations separated by very small intervals, the imperfection of 
the uncorrected formula will be likely to affect the orbit less than 
the errors of the observations. 

The computer may prefer to determine the orbit from the fir.st and 
third heliocentric positions with their times. This process, which has 
certain advantages, is perhaps a little longer than that here given, and 
does not lend itself quite so readily to successive improvements of the 
hypothesis. When it is desired to derive an improved hypothesis 
from an orbit thus determined, the formulae in §X1I of the summary 
may be used. 


summary of formul-^: 


WITH DIRECTIONS FOR USE. 

(For the case in which an approximate orhit i-n known in advance, see Xll.) 


1 . 

Preliminary computations relating to the intervals of time. 
tj, times of the observations in days. 

log/i: = 8'2355814 (after Gauas) 



+ + U _ V + + e, _Tl‘‘^-hTi'r3~X3“ 

“ 12 12 12 

For control: ^B.^ = 


II. 

Preliminary com 2 mtations relating to the first observation. 

Fj, (components of (5i) = the heliocentric coordinates of the 

earth, increased by the geocentric 
coordinates of the observatory, 
(components of 5^) = the direction-cosines of the observed 
position, corrected for the aber¬ 
ration of the fixed stars. 
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Preliminary computations relating to the second and third 
ohservations. 

The formulflo are entirely analogous to those relating to the first 
observation, the quantities being distinguished by the proper suffixes. 

III. 

Equations of the first hypothesis. 

When the preceding quantities have been computed, their numerical 
values (or their logarithms, when more convenient for computation,) 


are to be substituted 

in the following equations : 



Components of (Si 

a,=^ 1+nfg,+1’- - (e,.S,)) 

/8, = ^,i/j(l-f^,)(7i+ ^'-{6i-5i)) 

►III 

^ rf 



For control: 

V = V+/3iHy,''^ =+,H1 + 


00 

Components of (B' 

= + + 1 

nr 

y' = .4 if, + 4, - P'y, j 





Components of 3^, 



a,= -^,(1 + 



/3,= -,,(] -((5,-s,)) 

III, 

11 

y2=-4(l-^2)(9e + | -(6-3-52)) 


For control: 

.%/^aJ + /3,^ + yJ = (]-E,)W 


p" .. ^^2^2 

Components of 0" 

a' — — ^2 “h ^ 2^2 ~h -F ^2 ^ 

y"=-f3+f2^2+P"y2j 



Components of 0^ 


9s = Pa + ( 63 - 53 ) 

«3= ^3f3(l + R3)(93+ ( 63 - 53 )) ' 



/33 = ^3’?3(1+-R3)(93+ - ( 63 . 83 )) 

III, 

to 

11 

y, = ^3^3(1 +^3)(93 + f - ( 63 . 53 )) ^ 
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For control: 




+ y/ - ^3^(1 + R,)W 


Components of 

a'" = ^3^3 + 

^ = i 4 3/^3 + -^s^/g^s R' ^3 

y Vs. 


Ill 


The computer is now to assume any reasonable values either of the 
geocentric distances, pj, pg* p8> c>r of the heliocentric distances, r-g, 
r, (the former in the case of a comet, the latter in the case of an 
asteroid), and from these a^ssumed values to compute the rest of the 
following quantities; 


By equations Ilh, III', 

By equations III 2 , III". 

By equations II Ig, III" 

9i 

92 

92 

log'/', 

log 7-2 

log^s 

logi2. 

log/Zj 

log/J, 

log(l + 

logCl-ZJ^) 

log(l + jK,) 

log/- 

log/-' 

log P" 

“1 

Oj 

«3 

A 

^2 

/8s 

Vi 

72 

Vs 

a 

a" 

nr 

a 



r 

y 

y" 

y 


IV. 


Calculations 

relating to differential coefficients. 

C/omponents of @"x(S"' 

Components of x 

Components of @'x@" 



«3=/8'y"-y'^" 

61 = y'a" — a'y” 

b^ = y"'a—a"y' 

\=y'a'—ay" 

c^~a p p a 

c, = a"'/3'-/3"a' 

c,^a^'-^a 


G = ( +c^y = +Cgy" == ago'^H- 63^"'+Cgy'". 

These computations are controlled by the agreement of the three 
values of G. 

The following are not necessary except when the corrections to be 
made are lai^e: 

ir=(g 2 <S'" 0 ') = ag ^2 + M 2 + ^ 2 f 2 
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V. 

Corrections of the geocentric distances. 

Components of (S- 

a = a, + a, + a3 C,= 

/8 = A + ^,4-/83 

y=yi4-v34-v3 C'3=-^s^±M+£sy 

(This equation will generally be most easily solved by repeated 
substitutions.) 

^'}i = Ci-'AsL(Aqtf Aqi=-C^-^L{Aqjf. 


VI. 

Successive correctioTis. 

A^i, £1^2’ -^^8 ^ ^ added as corrections to q^, q^. With the 

new values thus obtained the computation by equations IIIj, Illg, Illg 
are to be recommenced. Two courses are now open: 

(a) The work may be carried on exactly as before to the deter¬ 
mination of new corrections for q^, q^, q^. 

(b) The computations by equations IIF, III", III'", and IV may be 
omitted, and the old values of Oj, 6^, c^, a^, etc., O, and L may be 
used with the new residuals a, )8, y to get new corrections for gj, gj, 
gg by the equations 

A ^2 

Agi = ( 7 i -f Agg - , Agg = ^3-1- Agg ~ C^ , 

where Dq^ denotes the former correction of q^, (More generally, at 
any stage of the work, Dgg will represent the sum of all the corrections 
of gg which have been made since the last computation of a^, etc.) 

So far as any general rule can be given, it is advised to recompute 
Up 6j, etc., and G oTice, perhaps after the second corrections of g^ q^, 
gg, unless the assumed values represent a fair approximation. Whether 
L is also to be recomputed, depends on its magnitude, and on that of 
the correction of gg, which remains to be made. In the later stages 
of the work, when the corrections are small, the terms containing L 
may be neglected altogether. 

The corrections of g^, gg, q^ should be repeated until the equations 
a = 0 ^ = 0 y«0 
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are nearly satisfied. Approximate values of may suffice for 

the following computations, which, however, must be made with the 
greatest exactness. 

VII. 


Test of the first hypothesis. 

log ')"!, logr.^, log(approximate values from the preceding 
computations). 

.Si = A,r,-f 

<^2 = n — 

'^3 ~ A ^r 3 “f“ A ^B^r 3 

s-s^, S-.S3. 


The value of may be very small, and its logarithm in con¬ 

sequence ill determined. This will do no harm if the computer is 
careful to use the same value—computed, of course, as carefully as 
possible—wherever the expression occurs in the following formulee: 


■=v 


p= 


' (8-8i)(8-8,)(a-83) 

8 

2(5 — 52) 

N 


tani('y 2 ~'^i)==-~-\ 

5 —«3 

R 

tanJK-i;,) = —^ 
tan 1(^3= 


For adjustment of values: i {v^ —^ 1 ) = 4 ('^2 '^ 1 ) + i ('^s ““ '^ 2 ) 

V V 

e sin 4(Va-h^i) = H~":—Vt—^^ 


For control: 


tan |(t; 8 -f t^i) 


P -I 
e cos ^ — 1 




— e 
+ e 


1 = ^ 
1 — e® 


tan = € tan \%\ tan — € tan ^Vo tan \E^ = e tan \v^ 
Tj c»jc. = a\E^ — E^ -f- ea^ sin E^ — ea^ sin E^ 

Tg cfcic. = *“ ^1)+““ ^2 
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VIIL 

F(yt' the second hypothesis. 

==‘0057613/c(/02 —/ 03 ) (a])erration-constant after Struve.; 

- 00576 - P 2 ) log (•0057613A:) - 5‘99610 

A log Tj = log T, - log (ti ck. - <5t,) 

^ log = log Ts - log (tj <»ic, - Sts) 

A log (t,T 3 ) = a log t, + a log T 3 

A log = A log Tj — A log Tj 

■^3 

Alog^,= -^jAlog^* 

Alog. 43 = -.4,Alog-’~ 

■^3 

A log i?i = A log (T1T3) - A log “ 

Alog52 = Alog(TiT3)+--ij^-'‘ Alog^ 

A log ^3 = A log (t,T 3 ) + -^ ^3 

These corrections are to be added to the logarithms of -4^, 

B„ £ 3 , in equations IIIi, HI?, III 3 , and the corrected equations 
used to correct the values of q^, q^, until the residuals a, y 
vanish. The new values of A^ must satisfy the relation 

^1 + ^3 = 1 , and the corrections Alog-4j, Alog-dj must be adjusted, 
if necessary, for this end. 

Third hypothesis. 

A second correction of equations III|, Illg, III 3 nnay be obtained in 
the same manner as the first, but this will rarely be necessary. 

IX. 

Determination of the ellipse. 

It is supposed that the values of 

«i. A. yi> “ 2 - A- V2> “s- A> rs- 

^*11 B-\y B^y R^y 8^y S^y S 3 > 

have been computed by equations IIIj, IHg, III 3 with the greatest 
exactness, so as to make the residuals a, /3y y vanish, and that the 
two formulae for each of the quantities Sj, ^3 sensibly the same 
value. 
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Components of <5^ 

Components of <3^ 

a^ = a8~ai 

_ a4“2+^A+y4y2 

“6 “4 „ 2 - 

^2 

^4 ~ A ““ 

«4«2+AA+y4y8 

1 

li 

_ “4n2+^A+y4y2 

76 74 - 


<’5“ = n6^ + /36^ + 76‘ 

«=i(»l-f-«2 + S3) 

^ = i(«i+f4+*s) 


For control only; 


8-8 

* «(«-«,)(»-S 3 ) 




•AT— 

p= _ 2 r* 1 ._ 

^ Jf(8-8,)(8-3,) 


tan K^'s—^ 2 ) = ^ 


R 


-»1 


tan J(v,-t)j)= 


_ RS 


The computer should be careful to use the corrected values of j4 ,, 2 I j. 
(See VIII.) Trifling errors in the angles should be distributed. 


P-E 


e8in^(D3+Ui) = 


2sini(?i,—!J,) 


£ + £-2 

e cos i(^ 3 +*^i)=— 


2 c 08 i(W 3 -l’,) 


tan J(d,+v,) 


For control: 




a 


E. 

'2 

- P. 


e cos % = ^ — 1 


e l_e2 ^ = 


Direction-cosines of semi-major axis. 
COS sin 




' ag - - ttj 

Oil 




COS sin Vl 

n=--V 2 - ■ 


76 
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Directi on-cosines of serai-minor axis. 

^ sin Vo . COS Vo 

X=--- CI 5 

sin Vo ^ . cos V., ^ 

sin Vo . cos Vo 

Oimponents of the semi-axes. 

— ay— am = (///? 

hx — b\ hy = hfjL bz — hv 


X. 

Time of perihelion passage. 

Corrections for aberration. 

tan = € tan Jv, Sf = - -00576l3p^ 

tan JA/g = e tan Jvo == — ’0057613^2 

tan hEr. ~ € tan iv^ SL = — 0057613po 
log-()057613 = r76052 
^ 1 -f < 5 ^ 1 -* T = A;" ^ (.£^1 — c sin j5’i) 

t2-hSt2’-T=k-^a^(E.^ — es\n Ef) 

^3-|-<5^3 — T=^“^a^(i^3—6sin Ef) 

The threefold determination of T affords a control of the exactness 
of the solution of the problem. If the discrepancies in the values of T 
are such as to require another correction of the formulas (a thii-d 
hypothesis), this may be based on the equations 

AlogT, = ilf%^^'J AlogT 3 = ilf-®^^ 

C 3 tg H 

where ^(l)» ^( 2 )> ^(3) denote respectively the values obtained from the 
first, second, and third observations, and M the modulus of common 
lograrithms. 

^ XL 

For an ephemeris. 

\{t-T)=E-emnE 

Heliocentric coordinates. (Components of ) 

X — — edjc, -p cos E + b^^ sin E 
iy = — eoy +tty cos E-\‘hy sin E 
zz=z .^eaz-^az cos E 4 - sin E 

These equations are completely controlled by the agreement of the 
computed and observed positions and the following relations between 
the constants: 

axbz-^ciJby'^-athz — O — 6/ + V + = 
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When an approximate orbit is known in advance, we may use it to 
impr'ove our fundamental equation. The following appears to be tlie 
most simple method: 

Find the exeentric anomalies ii.,, and the heliocentric 

distances 7 \,, 7 '.^, which belong in the approximate orbit to the times 
of observation corrected for aberration. 

Calculate B^, as in §T, using these corrected times. 

Determine .4^ by the e<jUation 






sin (— E.j) — e sin BJ.^ + e sin E\ sin ( E.^ — E^) — e sin E.^^ + e sin E^ 
in connection with the relation . 4 j 4-^3 = l. 


Determine so as to make 


__'vi_ ''v:_. 

4 sin h {BJo — 7 &J) sin i{E.^ — E.,) sin J {E^ 




equal to either member of the last e(|nation. 

It is not necessary that the times for which E-^^, ^ 2 > ''* 2 ’ 

are calculated should precisely agree with the times of observation 
corrected for aberration. Let the former be represented by t/, , h/. 

and the latter by and let 

A log = logf/g" - f^") -log(/ 3 ^ - /./), 

A log T 3 = log(^ 2 " ~ ~ log(^/ “ ^'/)- 

W"e may find .Bg, ^ 3 , Bo, as above, using and then use 

A logTj, A log Tg to correct their values, as in § VIII. 


Numerical Example. 

To illustrate the numerical computations we have chosen the 
following example, both on account of the large heliocentric motion, 
and because Gauss and Oppolzer have treated the same data by their 
different methods. 

The data are taken from the Theoria Motus, .§ 159, viz., 


Times, 1805, lSeptemI)er 

0*51.336 

139'42711 

265-39813 

Longitudes of Ceres - 

j 95“ .32' 18"-56 

99“ 49' .5"-87 

118“ 5'28"-85 

Latitudes of Ceres 

I -O” 59' 

+ 7“16'36"-80 ! 

+ 7“ 38' 49"-39 

Longitudes of the Earth 

i 342“ 54' 56" *90 

117“ 12'4.3"-25 

241“ 58' 59"-71 

Logs of the Sun's distance 

i 0'003]514 

9-9929861 i 

9-9056974 


The positions of Ceres have been freed from the effects of parallax 
and aberration. 
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I. 

From the given times we obtain the following values: 


Control: 


k-h 

■^1 
T , 

R) 

B. 

b] 


Nuiu>>crs. 


133*91375 

1*25-971(>‘2 

259-88477 

•4847187 

•0152812 


Loganthuis. 


2-1268252 

2-100-2706 

2-414780J) 

9-6854897 

9-7120443 

•3358520 

-36-24066 

9*6692113 

•31837-2-2 

0-5623916 


4- A + -4 3^3 = 2*4959086 
iTiT 3 = 2-4959081 


II. 


From the given positions we get: 


log A, 

9*9835515 

+ 

logX, 

9-6531725 

_ 1 

; logA^ 

9-6775810 

_ 

log r, 

9-4711748 

- 

logF, 

9-9420444 

+ 

! logr, 

9*9515547 

- 


0 



0 



0 


l<>gh 

8*9845*270 

- 

l0g^2 

9-2*282738 

- 

log ^3 

9 6690294 

- 

log Vi 

9*9979027 

+ 

1 log Vi 

9-9900800 


log Vs 

9-9416855 

1 

log^i 

8-2387150 

- 

i log ra 

9-1026549 

+ 

logTs 

9*1240813 ' 

! + 


•3874081 

1 - 

1 

•9314223 

+ 

' ^8 • 

*5599304 

i - 

Pi" 

•8645336 

4- 

! ^ 2 ^ 

•1006681 

+ 


*7130624 

i + 


III. 

The preceding computations furnish the numerical values for the 
equations III,, III', IIIj, IIF, III 3 , III'", which follow. Brackets 
indicate that logarithms have been substituted for numbers. 

We have now to assume some values for the heliocentric distances 
Tj, 7^2 j ^ 8 * ^ mean proportional between the mean distances of Mars 

and Jupiter from the Sun suggests itself as a reasonable assumption. 
In order, however, to test the convergence of the computations, when 
the assumptions are not happy, we will make the much less probable 
assumption (actually much farther from the truth) that the helio¬ 
centric distances are an arithmetical mean between the distances of 
Mars and Jupiter. This gives *526 for the logarithm of each of the 
distances r,, r^, r^. From these assumed values we compute the first 
column of numbers in the three following tables : 
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= -8874081 ai= -[8-6700167](gi-9-5901555)(l 

r-i»=9i*+-8645336 /3,= [9-6833924]{y,+ •0900552)(l4-iJi)[ IIIj 

i2i = [9-6692113]ri-« y,=-[7-9242047](9,+ -3874081)(H-Jii)j 


a'= --046775-[8-67002] JBi-P'ui 1 
jy = /3' = -482383 -f [9 68339]i?i - i III' 

^ y'= --008399-[7-92420] Ei-P'yJ 


Agi 



- -66731 

- *04558 

- -0010434 

+ •0000006 

9i 

+ 

3*22606 

2*55875 

2*51317 

2*6142134 

2-5142140 

logr, 

+ 

•52600 

*434960 

•4280791 

•428*2376 

*4282377 

log/?, 

+ 

8*09121 

8*364331 

8*3849740 

8*3844986 


log(l + /?,) 

+ 

•00533 ; 

•009934 

*0104122 

•0104010 


logP' 

+ 

8-01967 

8*369628 

8*3957468 

8*3951457 


«i 

+- 

•30136 

*336506 

*3390605 

•3390018 


/Si 

+- 

! 1*61938 

1-307304 

1*286223 

1 *2867056 


7i 

- 

•03072 

•026316 

*0249518 

•0249601 


a' 

- 

•050506 


•0663438 



/S' 

+ 

•47139 


•4620942 



V 

“ 

•00818 


•0079821 




g5,=pj +-9314223 
r 22 = g /-+--1006681 
E2=[0-3183722]rj-* 

[-47712 ]E,g, 

(l-5j)r/ 


oj = + [9-2282738] (9 j+1-7286820)( 1 - R^)\ 

182 = - [9-9900800](^?2- •0361309)(1 -R^) [lllj 
y 2 = -[9-1026549]((?j- •9314223)(l-ie2)l 
a"= -169151-[9-22827]iJj+P"a2 1 

13" = --977417-f [9-99008] i III" 

y"= --126664-f [9-10265] i? 2 +P"y 2 j 


Ag-a 



- *77826 

+ *005042 

+ -0013222 ! 

+ -(K)00021 


+ 

3*34235 

2-56409 

2-569132 

2-5704542 

•2-5704563 

logra 

-r 

•52600 

•412233 

*4130733 

•4132934 

•4132937 

logiia 

- 

8*74037 

9*081673 

9-0791624 

9-0784920 


log(l-i?a) i 

-i- 

9-97543 

9 944142 

9-9444866 

9*9445766 


logP" 

~ i 

8*71411 

9 199120 j 

9*1954270 

9 1944598 


«2 

+ 

■81059 i 

*638489 i 

•6397466 

•6400760 


/Sa 

- 

3*05379 

2*172660 

2*1787230 

2 1803116 


72 

- 

*28858 

*181843 

-1825486 

•1827338 


a' 

-- 

'20182 


•2491864 



/S" 

- 

1 08177 


1-2018221 



7 " 

- 

*13464 


•1400944 

i 



5 ,=/j, --5599304 
r 5 * = g32 +-7130624 
iJ3=[9-5623916]r3-» 

[ -477l2]j;3g3 

(1 + J?3)r3'^ 


a,= - [9-3810737] (^3 +1-5798163)(1 H-jK,)] 

/3,= [9-6537308](?3- -4630521 Xl + ii!,)IIII 3 

ys= [8-8361256](g3+ -5599304)(l-f/Jj)] 
a"' = - -240477-[9-38107] R^-P'a^ ] 

^"'= + -450537-f [9-65373]ie8-P'";83i III ' 
y'" = + -068569 + [8-83613]iJ3- /^"yj j 
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- -80780 

- -04055 

+ -00-25316 

+0000031 

93 


3*24945 

2-44165 

2-40110 

2-4036316 

2-4036347 

»*3 

+ 

0*52600 

•412217 

•4057319 

•4061394 

•4061399 

log ^3 

+ 

7*98439 

8-325742 

8-3451948 

8-3439733 


logd + /«';,) 


•00417 

•(XHK 199 

•(X)95!(k8 

•f>094843 


log 

-f 

7*91715 

8-.357016 

8-.3817516 

8-3801993 



- 

1 17^53 

•98751K) 

•9785152 

•9790776 




1-26749 

•910305 

-8924956 

•8936069 


73 

+ 

■26373 

•210171 

-‘2075-292 

-2076940 


a'" 


-22847 


•2-2-2'2335 ; 



fi'" 

+ 

I -44441 


■4390163 



7'" 

4- 

•06690 


1 06r>0H88 1 

_1 



IV. 

The values of a', , etc., furni.sli the basis for the computation of 

the following quantities: 

-*01254 a2== --03517 - 07232 

63 = -f *01726 -*00525 63= -*00845 

~ -15746 = - *08526 C3 = - 04050 

For G we get three values sensibly identical. Adopting the mean, 
we set 

(7 = 01006 . 

We also get 7^ = - *00998, L = 02322.* 


V. 

Taking the values of a,, etc., from the columns under IITi, in 2 > 
III 3 , we form the residuals 

a=- 06058, ^=-*16692, y=-*05557. 

From these, with the numbers last computed, we get 

( 7 ^ = - -65888, C; = - *76983, C 3 = - -79939, 

which might be used as corrections for our values of Qi, ^s- To 
get more accurate values for these corrections we set 

= or --76983-■01393(Ag2)^ 

which gives —’77826. 

The quadratic term diminishes the value of Ag^g ‘00843. Sub¬ 
tracting the same quantity from Cj and we get 

Aq^ = - *66731, A 93 = - *80780. 

* It would have been better to omit altogether the calculation of JI and L, if the 
small value of the latter could have l>een foreseen. In fact, it will be found that 
the terms containing L hardly improve the convergence, l)eing smaller than quantities 
which have been neglected. Nevertheless, the use of these terms in this example will 
illustrate a process which in other cases may be beneficial. 
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VI. 

Applying these correction.s to the values of we compute 

the second numerical columns under equations ITI^, III.,, and 111 .^. 
We do not go on to the computations by equations III', etc., but 
content ourselves with the old values of b^, etc., G, and X, which 
with the new residuals 

= - 012595, *044941), y= *008012, 

give C\ = - *04507, Cl = *004952, C\ = - *04064. 

^^2 = ^' 2 - L{I)q,+ Ag. = *004952 - *02322( - 77826 + *00247) A^,. 

This gives Af/.^ ~ ‘005042. 

As the term containing L has increased the value of A 72 00009, 

we add this quantity tf) (\ and and get 

Aq^ = - *04558, Aq,^ = - *04055. 

With these corrections we compute the third numerical columns 
under equations IIIj, etc. This time we recompute the quantities a', 
etc., with which we repeat the principal computations of IV, and get 
the new values 

-*0167215 - immry u.,- -*0743299 

== + *0149145 b,==~ *0054413 ^3 = ~ ‘0098825 

tq = - *1576886 c, - - *0779570 Cg ••= - 0474318 

a 0090929. 

The quantities H and L we neglect as of no coiisecjuence at this 
stage of the approximation. 

With these values the new residuals 

a = + 0002919, /*?= -*0000044, y= +*0000288, 

give Aq^=^C^= +*0010484, Aq.^^C^^ +*0013222, 

A 7 .,= Cg=+ 0025316. 

These corrections furnish the basis for the fourth columns of 
numbers under ecpiations llli, etc., which give the residuals 

a = + 0000002, /3 - + 0000009, y = + *0000001, 

and the new corrections 

Aq^ = +*0000006, Aq^^ +*0000021, Aq.^=- +*0000031. 

The corrected values of q.^, q^ give 

log = 0*4282377, log r, = 0*4132937, log rg = 0*4061399. 

We have carried the approximation farther than is necessary for 
the following correction of the frrmula, in order to see exactly where 
the uncorrected formula would lead us, and for the control afforded 
by the fourth residuals. 
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VIL 

The computations for the test of the uncorrecteri formula (the first 
hypothesis) are as follows: 




Numt>er or arc. 

Logarithm. 




0 4282,377 
0-413-2937 

^3 



0-4(Hil399 


4- 

■01174805 

8-1^99879 


4 

•11980944 

9-0784911 


4 

•01137070 

8-0500102 

A" 


• 14-29:1479 

9-1551380 


-1 

1 -,3:108470 

0-1241283 

«2 

-T 

2-2790010 

0-3578704 


4- 

1-3417404 

0-1-270085 


4- 

2-4701248 

0-3937725 

H ~ 

-i- 

1-1452772 

0-0589100 

H - .So 

+ 

0-1904632 

9-2932812 

S - .s., 


1 i:i43844 

0-0547002 

H 

4 


9-5005898 

P 

4 


0-4391732 


4 

15" 4<S' HP-82 

9-4518290 

tan i (iJg - v,^) 

4 

15^ 39' :i6''-3K 

9-4470792 

tani(»’3 - I'l) 

4 

31° 27' 47"-20 

9-7866915 

e sin i (f»8 4 t/j ) 

- 


8-7099387 

e cos i (i’8 4 rj) 

, 4 


8-7872701 

tan i (^’3 4- f’j) 

- 

' - 39° 55' 32"-31 

9-9226080 

e> 



8-9025438 

f 

: ■+■ 

i 

9-9^152259 

a 

4 


0-4419546 

tan 


' -35° 41' 39"-75 

9-8503809 

tan 

1 

1 - 19° 53' 28"-93 

9-5584981 

tan ir.. 

1 

- 4° 13' 52"-55 

8-8091:480 

tan ^£*3 

- 

1 -33° 33' 0"-l7 

9-8210068 

tan ^£’2 

- 

; - J8" 28' 6"-35 j 

9-5237240 

tan ^4 

- 

1 - 3° 54' 24"-21 

8-8343639 

sin A’l 

i ' 1 

-67° 6' (y''34 

9-9643473 

sin .^2 


- 36° 56' 12"-70 

9-7788272 

- sin Bg 


- 7° 48' 48"-42 

9 - 13:43734 

ea^ sin Bi 

- 

3387061 

9-5298230 

coJ sin 

- 

•2209545 

9-3443029 

eai sin 

- 

•0499801 

8-6988491 

a^(B.2-Bi} 

4 

2-4226307 

0-3842872 

a^(Bs-B.j} 

4 

2 3391145 

0-3690515 

“^3 call-. 

-t- 

2-3048791 

0-3026482 

’’l c-Me 

4 

2-1081461 

0-3360885 

1 
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VIII. 

The logarithms of the calculated values of the intervals of time 
exceed those of the given value.s by '0002416 for the first interval (t,) 
and 0002365 for the second (tj). Therefore, since the corrections for 
aberration have been incorporated in the data, we set for the correction 
of the formula (for the second hypothesis) 

A log Ti = - -0002365 A log T 3 = - -0002416 
This gives A log ^,= 0000026 A log =--0000025 
A log 5; = - '00048V 2 A log = - 0004782 A log 5, = - -0004665 

The new values of the logarithms oi A,, are 

log ^1 = 9-6854923 log.d 3 = 9-7120418 
Applying these corrections to equations IITj, UIj, III3,* we get the 
following; 

»-i2 = 5i2 +-8645336 

ai= -[8-6700193] (gi - 9-5901555)(l +fli) 
if, = [9-6687241],III, corrected. 
/3i = + [9-6833950] ( 5 , + -0900552)(1 4- if,) 
y,= -[7-9242073] (q, 4- '3874081 )(1 +ifi) 


A<7i 


1 

-f -0002887 

- -0000217 

Vi 1 


2-5142140 

2-51450-27 

2 5144810 

log rj 1 

+ 

1 -4282377 

•4*282816 

•4282782 

logR, 


8-3838110 

8-3838793 

8-3838894 

log(l + i?i) 

+ 

•0103847 

•0103863 

•0103865 

«1 

+ 

•3389910 

•3389784 

•3389796 

^1 ! 

+ 

1-2866654 

1 -2868124 

1 -2868024 

7i 

- 

•0249593 

•0249619 

•0249617 

log 81 




•1241571 

1 


r/ = gj2q. .1006681 I 

02= + [9-2282738] (?2+l-7286820)(l-if2) 

= [0-3178940]r 2 -» llllj corrected. 

/32=-[9-9900800] (92- -0361309)(1-ifj) 
y.2= -[9-1026549](92- '9314223)(1-if,)j 





- 0000955 

f *0000187 

Q 2 

+ 

2*0704563 

2-5703608 

2-5703795 

log r,j 


0-41329.37 

•41,32778 

•4132809 

log 7?2 

+ 

9*0780129 i 

9-0780605 

9 0780513 

log(l-i?3) 

+ 

9-9446418 ; 

9-9446353 

9*9446365 

“a 

-+■ 

•6401725 1 

•6401487 

•6401532 

^2 

- 

21806412 : 

2 1805261 

2 1805482 

72 

- 

•1827615 

•1827481 

•1827607 

log dg 




•;1579174 


The oorreotions may be made without rewriting the equations. 
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rg2 = 932 +’7130624 

ag = ~ [9’3810712] (^3+1-5798163)(1 + R^) 

i23 = [9'5619251]r3~2 

^ 3 - +[9*6537283] (^ 3 - '4630521 Xl + jRg) 
^ 3 = +[8*8361231] (^ 3 + •5599304)(l+i23) 


IIII 3 corrected. 


^^3 



4 - -0003302 

+ -0000424 

<73 

+ 

2 4036.347 

2-4039649 

2-4040073 

log ’'a 

+ 

•4061399 

•4061929 

4061998 

log /4 

4- 

8-34350.55 

8 .3433463 

8-3433257 

log (1 + R^) 

4 

•(X)94742 

•(K >94708 

•0094704 


- 

•9790.5fK) 

■9791236 

•9791329 


+ 

•89.35824 

■8937277 

•8937461 

7;i 

4 

■2076882 

■2077097 

•2077124 

log ^3 1 

1 

4 



•1-277120 


With these corrected equations the last values of q^ give 

the residuals 


a = 0001135 ^ - *0003934 

These give the corrections 

^q^ = 0002887 A^2 = ~ *0000955 

The next residuals are 

« = 0000035 ^ = 0000140 

which give the corrections 

= - 0000217 Afy2 = 0000187 
The next residuals are 

a = - 0000001 ^ = -0000003 

which must be regarded as entirely insensible. 


y:=- 0000326 
Ag3= 0003302 

y=-’0000008 
A^3 ='0000424 

y= 0000000 


IX, X. 

It remains to determine the ellipse which passes through the points 
to which the numbers relate in the last columns under the corrected 
equations IIIi, Illg, IHs^ and also the time of perihelion passage. 
The computations are as follows: 




Number or arc. j Logarithm, i i | Number or arc. j Logarithm. 
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a30‘OOCi<N2sDCOOQQ'-H^«DCOCDCOr'-r'C^COQO^ 
^'<to5'-'OOOl'OOaS>QOQC.-^«Ot'-'*t‘0‘OOiC5^0 t 
C'—'r'*cc'^‘oeot'-Osr--Ooo^ro«oc5't»<'^C4«5^QC Q 
i5i CO '*t oc o po r-4 r'' Cl ^ O 00 CD t- CO o CO 04 os 

O' Cl (N Cl 04 C' lO cc * Q CO I- 01 Q Cl I'' l" tr It 

0 »OOTp<;p'^'^-Hrt<Oj'«+'—«OTtOOC<:DO0CDCCr-.—<0 05 
C5 OC \0 :» OT 05 I," f 1 I - 05 » ^ CO « OO »0 OC ^ 

X 05 OC li C O OT CT- 05 1 ^- Cj ■» O O 05 o C' 05 lO- 05 00 --H 


QO QC 

o5 cc ‘O 


O Cl QC *0 CO 

22 S 80 S 

05 r-, I- 5 OC 

CO c6 I - -H cc r- 
p-. IC X) »o CO I '• 


^ ir* 05 05 I'' rf^r-OCQ 

i - 05 00 tfO Ol r- Q rf 05 »75 

CD 01 10 00 Q—'io-4coTri 

»0 »iO CO —< Q t'' r- I'- ■ 
QO 00 l'>CO ^ C0050505 


+ -I- . I -I- . + + 


' C* C -c'CaS'S^'CsT ^4' ^ c4* 


eoiorr'iot- 

^ ^ ^ o 

05 CO 05 -f 

CT 5 CD 00 *ti 
—I 10 CO 01 


rt r- Cl CD <X I- 01 -f CO »Q Cl CD»C-fCl':Dl'-X 05 r-ClCDl-Cl» 0 t- 

CO CD X —• C 5 CO X I- Cl CD X — «t iQ I ' *0 ^ I - CD of ^ 

^ 05 X CO X Cl CO C X CD ^ CD 05 01 ^ P “ CD —' C X LO t- C I - 

05 X(Pl'*X 05 't -~<05 -> XCD ^l-l-^X^OlcpcpCDXXCCgs-rX 

I-.XX'^XCD‘OC''C C' CO X —iXl'‘'tC 5 XX'-^ 05 -~tl'* 0 ^eO^ 

^ ~ " ■ ■“ ■ ‘D *Q iO X — — CD X — c ‘O X «9 rf X c 05 r-i 

t I - X o r -M CD X Di o X — -t r- r * I" 05 


I- X X X CD ‘6 C’ 'C Q CD X —' X 

•O 05 *0 X c rf O -t CD X X X — — 

CO X p P X X X X l^'i. X P P T- 

C O C O C 35 05 X t- I'* X C' 


5C5Cr5C^6565XX05 


XX—'XC’xasTr^xi-^rfco^xx 
(OicOrf<—<'i'XTPX®CO-^pM'P‘Cl'' 
—• X f' X —< CO 1 ' p »C Cl X —< 1 - p X CO 
— C' CD p C p X 05 X ^ '■rf 05 p 05 P Cl 


P ^ X X Cl ^ 

iiS^lsg 


— C'CDpCpx05X^'rf05pa5pCl CDCIOXXCDX 

XXClcSCSC—05CDX-^XC1X^^ XCl-^Tt'X — X 

—iO5X-^O5Xrtil'-*'-rt'XXCC0O5Cl p — p-^QXri< 

XXC105XXC0C1'^—< —Ol'XCDO COOX'^C105 




p ^ X cS 


l+ + + + 4- + 4 -'i- + + + + +i I I 4 4-1 l4-4-4-+-f-b 4-4-4-14-1 


^ »r cli! a oa! ^ ^ xT 

I I I - ..F* 5* «>' «N ~ ^ -.1 

>0 Cr X' ^1 ' ) I I 4' 4’ 4- 

■ '(Dts "1. "C 4! 

(S) ' ‘ C q c c ® s 


»4«4 + /34/S2 + 7472- 
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XL 

This gives the following equations for an ephemeris: 

2^~180(), June, 28'97450, Paris mean time 
[2-HmnHG](t - T) - E,, - [4-2216270] sin E 

Helitxieiitric coordinates relating to the i^cliptic. 

a:= 4--1820700 - [0-353036G] cos [01827457] sin E 
y = - -1244685 + [01878576] cos ^-[03603257] sin K 
2 = --0373970+ [!)'6656346] cos J?+[9-3320292] sin E 

The differences of tlie values of ^(1)> ^(2). ^(3)* from their mean J', 
indicate the residual errors of this hypothesis. They indicate differ¬ 
ences in the calculated and the observed geocentric positions wliicli 
are represented by tlie geocentric angles subtended by the patli 
described by the planet in the following fractions of a day: ‘00054, 
•00008, ‘00052. Since the heliocentric motion of the planet is about 
one-fourth of a degree per day, and the planet is considerably farther 
from the earth than from the sun at the times of the first and third 
observations, the errors will be less than half a second in arc. 

If we desire all the accuracy possible with seven-figure logarithms, 
we may form a Odrd kypoihesiH based on th(i following corrections: 

A lov Ti = i/ = _ -0000017, 

A lo<r T, = Jin = - -0000018. 

The equations for an ephemeris will then be : 

7’=18()6, June 28‘96878, Paris mean time 
[2‘8863140](^ - T) = E ,,- [4-2216530] sin E 

Heliocentric coordinatOH relating to the ecliptic. 

u: = + -1820765 - [0 3530261] cos ^-[01827783] sin E 
y= --1244853 + 10-1.S789041CO.SA'-[0-3603153].sin E 
z=- -0373987 + [9-6656285] cos A+ [9 3320758] sin E 

The agreement of the calculated geocentric positions with the data 
is shown in the following table; 


Times, 1805, September 

5-513.% 

139-42711 

265-39813 

Second hypothesis : 
longitude.s 
errors - 

latitudes 

erroi'S - 

95" 32' 18"-88 

0 " 32 
-0° 59' 34"-01 
0"'05 

99" 49' 5"-87 
0 "-(K) 

7" 16' 36"-82 
(/'■02 

118" 5'28"-52 
-(y'-33 

7" 38' 49"-34 
- ()" -0.> 

Third hypothesis ; 
longitudes 
errors - 

latitudes 

errors - 

95" 32' 18"-65 
(r-09 
-0" 69' 34"-04 
0"-02 

99" 49' 5"-82 
-(r-05 

7“ 16' 36"-78 
-0"-02 

i 118° 5'28"-79 
-(f-OG 
r 38' 49"-38 
-(roi 
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The immediate result of each hypothesis is to give three positions of 
the planet, from which, with the times, the orbit may be calculated in 
various ways, and with different results, so far as the positions deviate 
from the truth on account of the approximate nature of the hypothesis. 
In some respects, therefore, the correctness of an hypothesis is best 
shown by the values of the geocentric or heliocentric distances which 
are derived directly from it. The logarithms of the heliocentric 
distances are brought together in the following table, and correspond¬ 
ing values from Gauss* and Oppolzert are added for comparison. It is 
worthy of notice that the positions given by our second hypothesis are 
substantially correct, and if the orbit had been calculated from the 
first and third of these positions with the interval of time, it would 
have left little to be desired. 



logrj. 


log ^3- 

First hypothesis 

•4282377 

•4132937 

•4061399 

Second hypothesis - 

•4282782 

•4132809 

•4061998 

Third hj^xitheeis 

•42827H8 

■4132808 

amm 

(iauBs; 




First hypothesis 

•4323934 

•4114726 

•4094712 

Second hj^othesis 

•4291773 

•4129371 

•4071975 

Third hypothesis 

•4284841 ’ 

4132107 

■4064697 

Fourth hypothesis - 

•4282792 

•4132817 

•4062033 

Oppolzer : 

i 



First hypothesis 

•4281340 

4133,‘K) 

•4061699 

Second hypothesis - 

•4282794 

•4132801 

•4061976 

Third hypothesis 

•4282787 

* 1 

•4062009 


In comparing the different methods, it should be observed that 
the determination of the positions in any hypothesis by Gauss’s 
method requires successive corrections of a single independent variable, 
a corresponding determination by Oppolzer’s method requires the 
successive corrections of two independent variables, while the corre¬ 
sponding determination by the method of the present paper requires 
the successive corrections of three independent variables. 


* Thtoria motus, § 159. 

t Lehrbuch zur Bahnbealimmung der Kometen und Planeten, 2nd ed., vol. i, p. 394. 





VI. 


ON THE USE OF THE VECTOR METHOD IN THE 
DETERMINATION OF ORBITS 

Letter to Dr. Hugo Buchholz, Editor of Klinkerfues’ The<yretische 

Astronornie.*' 

New Haven, October, 1898. 

Dr. Hugo Buchholz, 

My dear Sir,—The opinion of Fabritius t on the comparative con¬ 
venience of different methcjds is entitled to far more weight than 
mine, for I am no astronomer, and have calculated very few orbits, 
none, indeed, except for the trial of my own formulse. The object of 
my paper was to show to astronomers, who are rather conservative 
(and with right, for astronomy is the oldest of the exact sciences), 
the advantage in the use of vector notations, which I had learned in 
Physics from Maxwell. This object could be best obtained, not by 
showing, as I might have done, that much in the classic methods 
could be conveniently and perspicuously represented by vector 
notations, but rather by showing that these notations so simplify 
the subject, that it is eavsy to construct a method for the complete 
solution of the problem. That the method given is the best possible, 
I certainly do not claim, but only that it is much better than I could 
have found without the use of vector notations. Some of the more 
obvious crudities in my paper have been corrected in that of Beebe and 
Phillips4 Doubtless many more remain, even if the general method 
be preserved. 

My first 011011 ) 8 , however, to solve the fundamental approximative 
equation were along the same lines which Fabritius has followed:— 
to set — and r^^r^ in equation second of ( 2 ) of Fabritius, which 
will give and then to get from the first of (3) of Fabritius, 
and then 7^3 either from equation second of (3) or from some other 

* [In which the preceding memoir, and also that of Beebe and Phillips referred to 
below, are translated. ] 

t[See Fabritius, W. “ Uol>er eine leichte Methode der Bahnbeatimmung mit 
Zugrundelegung des Princips von Gibbs,” also “ Weitere Anwendungen des Gibbs’chen 
Principe,” Astronomische Nachrickten, No. 3061, 3065 (1891)]. 

t[“The Orbit of Swift’s comet, 1880 V, Determined by Gibbs’s Vector Method, ’ 
W, Beebe and A. W. Phillips. OotUcTs AMronomicai Jotirnal, vol. ix, Dec. 1889.] 
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which would serve the purpose, and then to find better values of 
pg, rg by setting in equation second of ( 2 ) 



the expressions in brackets denoting numbers derived from the 
approximate values already found. This is similar to or identical with 
the method of Fabritius, except that he combines with it the principle 
of interpolation (for tJie first value in the third “ hypothesis ”). As 
I found the approximation by this method sometimes slow or failing, 
notably in the case of Swift’s comet, 1880 V, I tried the method 
publislied in my paper. Indeed, it may be said that the method of 
my paper was constructed to meet the exigencies of the case of the 
comet, 1880 V. 

In ordinary cases I think that the method of Fabritius may very 
likely be better than that which I published. The equations are very 
simply and perspicuously represented in vector notations. I shall use 
the notations of my paper, writing Ey F, etc., for German letters.* To 
eliminate and pg from equation (7) in my paper, multiply directly 
Si X Sa- This gives 

.4.(1+5,) - (1 -53)] 

(a) 

To eliminate pg and rg, multiply by (^ 3 X 83 which gives 

- (1 -^i)[(ex® 353 )+P 3 ( 5 ,e» 53 )]=o. ( 6 ) 

When we have found pj, r^, pg, rg it is not necessary to eliminate 
any of them, and to save labor in forming the equation for P 3 , 'Tj, I 
should be inclined to take the components in (7) in the direction of 
one of the coordinate axes, choosing that one which is most nearly 
directed towards the third observed position. However, I will write 

+ ^3(1 +|»)[(®3.?) + P3(53-?)] =0, (c) 

where ^ may represent an axis of coordinates, or (®ix 5 i) which 
would give Fabritius’ equation. It might be directed towards the pole 

* [In the remainder of the letter ae here printed German oapitals have been substituted 
for the Ey Fy Py etc. of the original, thus making the notation uniform with that of 
the paper referred to.] 



VECTOR METHOD IN THE DETERMINATION OF ORBITS. 161 


of the ecliptic, which would make vanish, 

except for exceedingly minute quantities depending on the latitude of 
the sun and the geocentric coordinates of the observatories, if these 
are included in ©j, (Sg* 

The equations (a), (b), (c), which are together equivalent to (7), I 
would solve as follows, almost in the same way as Fabritius, but 
relying a little more on inte^i-yoUiticm, and less on the convergence of 
which he speaks, which in special cases may more or less fail. 

Setting — and — in (a), which thus modified I shall call (a), 
and solving this (a) by “trial and error,” using p,y as the independent 
variable, as soon as I have a value of which 1 think will give a 
residual of ((/') of tlie same order of smallness as the effect of changing 

--3 and -T. into ~, 1 determine from this value by (b) and (c), and 

rp A’o' 

r.^, and then find the residual of (a), using the values of 
derived all from the same assumed p,^. Now using the last value of 
A (residual) 


Ap., 


in my previous calculations on {(i) which indeed applies 


only roughly to the {a), I would get a value p^ which I would use for 
the second “hypothesis” in (a). This will give a second residual in 
(a), which will enable me to make a more satisfactory interpolation. 
As many more interpolations may be made as shall be found necessary. 

Some such method, which should perhaps be called the method of 
Fabritius, would, I think, in most cases probably be the best for 
solution of equation (7). 

Of course I am quite aware that the merit of my paper, if any, lies 
principally in the fundamental approximation ( 1 ). I will add a few 
words on this subject. 

The equation may be written more symmetrically 




-St.;-'' 




It might be made entirely symmetrical by writing —t^ for T 2 . 

If an expression ending with had been used, we could still have 
satisfied two of the conditions relating to acceleration, and should 
have obtained 




Ila 

or 


II6 

or 


lie 
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Using an expansion ending with we can only satisfy one con¬ 
dition relating to acceleration, say the second. This will give 




^^^3 = 0. 


(Gauss uses virtually 


which is a little more convenient, but not, I think, generally quite 
so accurate.) 

Writing an equation analogous to III for the earth and subtracting 
from (7), Mem. Nat. Acad., we have 




which gives, on multiplication by and 52 X^ 2 ^ theorems of 

Gibers and Lambert. 

It is evident that in general the error in I is of the fifth order, in 
Ila, II6, IIo of the fourth, and in III of the third. But for equal 
intervals, the error in I is of the sixth order, and in III of the fourth. 
And when —Ila becomes identical with I, and its 

error is of the fifth order. 

The same is true of lie in the corresponding case. It follows that 
when the intervals are nearly as 5; 8 we should use Ila or II5 instead 
of I. This will evidently abbreviate the solution given above as 
only one of the quantities is to be used. 

The formulse Ila, II6, lie may also be obtained by the following 
method, which will show their relative accuracy. 

The interpolation formula 

Tl ^ _ ^2 I Ts A 

/>• 3 /y* 3 * M 3 

'2 '2 ~2 '8 

has an error evidently of the second order. If we multiply by 

2 I 2 __ 3 .^ 2 

^ - - and subtract from I, we get Ila. So if we multiply by 




we get II 6 or lie. The errors due to using one of these equations 
instead of I are therefore proportional to these multipliers and very 
unequal. 

Again, in case of equal intervals, Ila and He become identical with 
III. There is, therefore, no reason for using Ila or He when the 
intervals are nearly equal. II 6 is in this case much less accurate 
than HI. 
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It will be observed that all the formulae I, Ila, II6, lie, III, may be 
expressed in the general form 



except that the letters B^, B^y B^ have different values in the different 
cases, some vanishing in the more simple formnla3. Moreover, if the 
values of B^, B^y B^ have been calculated for I, the values for Ila, II6, 
or lie are found simply by subtraction of one of the numbers from the 
three. It is evident that II6 will hardly be useful except in special 
cases, as in the determination of a parabolic orbit in the failing case of 
Gibers’ method, and then it would be a (question whether it would not 
be better to determine the orbit from p., P 2 Pi> ^sing Ila 

or IIc. 

Equations Ila and IIc are very appropriate for the determination 
of an elliptic orbit when the observed motion is nearly in the ecliptic, 
by means of four observations with intervals nearly in the ratio 
5:8:5. 

It is evident that the f?olution of (7) given above may be varied, in 
ways too numerous to mention, by the use of the simpler forms Ila, 
IIc, or III for I in the earlier stages of the work. This only involves 
changing the values of B^, B^y B^, in (a), (b) and (c). 

It is not correct to say that in my expressions for the ratios of the 
triangles the error is of the fifth order in general, or for equal intervals, 
of the sixth. If we write Piy Pa* coefficients of 9^2; 9^3 

in I, and X for the error of the equation, we have exactly 

which gives 

9?iX5H3~"^2 

fj/y ^ \y 

Now^ l-l is my expression for the ratio of the triangles, and —^^ 

is its error. This is of the fourth order in general (since the denomin¬ 
ator is of the first), and for equal intervals, of the fifth. The same is 
true of the two other ratios. Thus we have 

SRi X 9^3 ~ P 2 X 9^8 

Adding these equations and subtracting 1 [from both sides] we have 
— _ Pi^Pa’^Pt , ( 9 I 3 — 

Here the last term, which represents the error, is of the fifth 
order in general, or for equal intervals, of the sixth. But the 
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quantity sought is of the second order, and the relative error is of 
the third order in the general case, or the fourth for equal intervals. 
It is precisely this error which is most important in the case of 
elliptic orbits. 

It will be observed that the accuracy of the expressions for the 
ratios [r^r ^]: [r^r ^: [r^rg] affords no measure of the accuracy of the 
formiila for the determination of elliptic orbits. 

I think that this hasty sketch wdll illustrate the convenience and 
perspicuity of vector notations in this subject, quite independently of 
any particular method which is chosen for the determination of the 
orbit. What is the best method ? is hardly, I think, a question which 
admits of a definite reply. It certainly depends upon the ratio of the 
time intervals, their absolute value, and many other things. 

Yours very truly, 

J. Willard Gibbs. 

P.8 ,—If we wish to use the curtate distances, with reference to the 
ecliptic or the equator, let be defined as the distance multiplied by 
cosine (lat. or dec.), and b. 8 a vector of length secant (lat. or dec.). 
For the most part the forrnulse wdll require no change, but the square 
of will be sec^(lat. or dec.) instead of unity, so that the last terms 
of (8) will have this factor. Gauss’ (0.1.2.), 

whereas in my paper (5i5253) is Lagrange’s 


J. W. G. 



VII. 


ON THE ROLE of QUATERNIONS IN THE. ALGEBRA 
OF VECTORS. 

[Nature^ vol. XLlll. pp. 511-513, April 2, 1891.] 

The following passage, which has recently come to my notice, in 
the preface to the third edition of Prof. Taits Qxiatervions seems 
to call for some reply : 

“ Even Prof. Willard Gibbs must be ranked as one of the retarders 
of quaternion progress, in virtue of his pamphlet on Veeior Avalyais, 
a sort of hermaphrodite monster, compounded of the notations of 
Hamilton and of Grassmann.” 

The merits or demerits of a pamphlet printed for private distri¬ 
bution a good many years ago do not constitute a subject of any great 
importance, but the assumptions implied in the sentence quoted are 
suggestive of certain reflections and inquiries which are of broader 
interest, and seem not untimely at a period when the methods and 
results of the various forms of multiple algebra are attracting so much 
attention. It seems to be assumed that a departure from quaternionic 
usage in the treatment of vectors is an enormity. If this assumption 
is true, it is an important truth; if not, it would be unfortunate if it 
should remain unchallenged, especially when supported by so high an 
authority. The criticism relates particularly to notations, but I 
believe that there is a deeper question of notions underlying that of 
notations. Indeed, if my offence had been solely in the matter of 
notation, it would have been le.ss accurate to describe my production 
as a monstrosity, than to charactenze its dress as uncouth. 

Now what are the fundamental notions Avhich are germane to a 
vector analysis ? (A vector analysis is of course an algebra for 
vectors, or something which shall be to vectors what ordinary algebra 
is to ordinary quantities.) If we pass over those notions which are so 
simple that they go without saying, geometrical addition (denoted 
by -f) is, perhaps, first to be mentioned. Then comes the product of 
the lengths of two vectors and the cosine of the angle which they 
include. This, taken negatively, is denoted in quaternions by Sa^S, 
where a and ^ are the vectors. Equally important is a vector 
at right angles to a and ^ (on a specified side of their plane), and 
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representiEg in length the product of their lengths and tlie sine of 
the angle v^hich they include. Tliis is denoted by Va/3 in quaternions. 
How these notions are represented in my pamphlet is a question of 
very subordinate consequence, which need not be considered at 
present. The importance of tliese notions, and the importance of a 
suitable notation for them, is not, I suppose, a matter on which there 
is any diti'erence of opinion. Another function of a and called 
their produj^t and written a/S, is used in quaternions. In the general 
case, this is neither a vector, like Va^, nor a scalar (or ordinary 
algebraic quantity), like Sa^, but a quaternion—that is, it is part 
vector and part scalar. It may be defined by the eqiiation— 

a^=Va/3 + Sa/3. 

The question arises, whether the quaternionic product can claim a 
prominent and fundamental place in a system of vector analysis. It 
certainly does not hold any such place among the fundamental 
geometrical conceptions as the geometrical sum, the scalar product, 
or the vector product. The geometrical sum a-f/3 represents the 
third side of a triangle as determined by the sides a and Ya^ 
represents in magnitude the area of the parallelogram determined by 
tlie sides a and /9, and in direction the normal to the plane of the 
parallelogram. SyVa/3 represents the volume of the parallelepiped 
determined by the edges a, /3, and y. These conceptions are the very 
foundations of geometry. 

We may arrive at the same conclusion from a somewhat narrower 
but very practical point of view. It will hardly be denied that 
sines and cosines play the leading parts in trigonometry. Now the 
notations Va/3 and Saf3 represent the sine and the cosine of the angle 
included between a and /3, combined in each case with certain other 
simple notions. But the sine and cosine combined with these 
auxiliary notions are incomparably more amenable to analytical 
transformation than the simple sine and cosine of trigonometry, 
exactly as numerical quantities combined (as in algebra) with the 
notion of positive or negative quality are incomparably mf<re amenable 
to analytical transformation than the simple numerical quantities of 
arithmetic. 

I do not know of anything which can be urged in favor of the 
quaternionic product of two vectors as a fxmdamcntal notion in 
vector analysis, which does not appear trivial or artificial in com¬ 
parison with the above considerations. The same is true of the 
quaternionic quotient, and of the <[uaternion in general. 

How much more deeply rooted in the nature of things are the 
functions Sa/3 and Ya^ than any which depend on the definition 
of a quaternion, will appear in a strong liglit if we try to extend 
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our formulae to space of four or more dimensions. It will not be 
claimed that the notions of quaternions will apply to such a space, 
except indeed in such a limited and artificial manner as to rob them 
of their value as a system of geometrical algebra. But vectors exist 
in such a space, and there must be a vector analysis for such a space. 
The notions of geometrical addition -and the scalar product are 
evidently applicable to such a space. As we cannot define the- 
direction of a vector in space of four or more dimensions by the con¬ 
dition of perpendicularity to two given vectors, the definition of Va/3, 
as given above, will not apply totidem verhis to space of four or more 
dimensions. But a little change in the definition, which would make 
Qo essential difierence in three dimensions, would enable us to apply 
the idea at once to space of any number of dimensions. 

These considerations are of a somewhat a, priori nature. It may bo 
more convincing to consider the use actually made of the quaternion 
as an instrument for the expression of spatial relations. The principal 
use seems to be the derivation of the functions expressed by 8a/5 and 
Va/3. Each of these expressions is regarded by quaternioiiic writers 
as representing two distinct operations; first, the formation of the 
product which is the quaternion, and then the taking out of this 
quaternion the scalar or the vector part, as the case may be, this 
second process being represented by the selective .symbol, S or Y. 
This is, I suppose, the natural development of the subject in a treatise 
on quaternions, where the chosen subject seems to require that we 
should commence with the idea of a quaternion, or get there as soon 
as possible, and then develop everything from that particular point of 
view. In a sy.stem of vector analysis, in which the principle of de¬ 
velopment is not thus predetermined, it seems to me contrary to good 
method that the more simple and elementary notions should be defined 
by means of those which are less so. 

The quaternion affords a convenient notation for rotations. The 
notation q{ )q~^, where ^ is a quaternion and the operand is to be 
written in the paTenthesis, produces on all possible vectors just such 
changes as a (finite) rotation of a solid body. Rotations may also be 
represented, in a manner which seems to leave nothing to be desired, 
by linear vector functionvS. Doubtless each method has advantages in 
certain cases, or for certain purposes. But since nothing is more 
simple than the definition of a linear vector function, while the 
definition of a quaternion is far from simple, and since in any case 
linear vector functions must be treated in a system of vector analysis, 
capacity for representing rotations does not seem to me suflftcient to 
entitle the quaternion to a place among the fv.ndamental and necessary 
notions of a vector analysis. 

Another use of the quaternionic idea is associated with the symbol V. 
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The quantities written SVa) and VVo), where w denotes a vector having 
vaiue^s which vary in space, are of fundamental importance in physics. 
In quaternions these are derived from the quaternion Vm by selecting 
respectively the scalar or the vector part. But the most simple and 
elementary definitions of SVw and Ww are quite independent of the 
conception of a quaternion, and the quaternion Vw is scarcely used 
except in combination with the symbols S and V, expressed or 
implied. There are a few formulaB in which there is a trifling gain in 
compactness in the use of the quaternion, but the gain is very trifling 
so far as I have observed, and generally, it seems to me, at the expense 
of perspicuity. 

These considerations are sufficient, I think, to show that the position 
of the quaternionist is not the only one from which the subject of 
vector analysis may be viewed, and that a method which would be 
monstrous from one point of view, may be normal and inevitable from 
another. 

Let us now pass to the subject of notations. I do not know 
wherein the notations of my pamphlet have any special resemblance 
to Grassmann’s, although the point of view from which the pamphlet 
was written is certainly much nearer to his than to Hamilton’s. But 
this a matter of minor consequence. It is more important to ask, 
What are the requisites of a good notation for the purposes of vector 
analysis ? There is no difference of opinion about the repre.sentation 
of geometrical addition. When we come to functions having an 
analogy to multiplication, the products of the lengths of two vectors 
and the cosine of the angle which they include, from any point of 
view except that of the quaternionist, seems more simple than the 
same quantity taken negatively. Therefore we want a notation for 
what is expressed by — Sa/?, rather than SajS, in quaternions. Shall 
the symbol denoting this function be a letter or some other sign ? 
and shall it precede the vectors or be placed between them ? A little 
reflection will show, I think, that while we must often have recourse 
to letters to supplement the number of signs available for the ex¬ 
pression of all kinds of operations, it is better that the symbols 
expressing the most fundamental and frequently recurring operations 
should not be letters, and that a sign between the vectors, and, as it 
were, uniting them, is better than a sign before them in a case having 
a formal analogy with multiplication. The case may be compared 
with that of addition, for which is evidently more convenient 

than E(a, /8) or Za/? would be. Similar considerations will apply to 
the function written in quaternions Va/3. It would seem that we 
obtain the 7ie plus ultra of simplicity and convenience, if we express 
the two functions by uniting the vectors in each case with a sign 
suggestive of multiplication. The particular forms of the signs which 
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we adopt is a matter of minor consequence. In order to keep within 
the resources of an ordinary printing office, I have used a dot and a 
cross, which are already associated with multiplication, but are not 
needed for ordinary multiplication, which is best denoted by the 
simple juxtaposition of the factors. I have no especial predilection 
for these particular signs. The use of the dot is indeed liable to the 
objection that it interferes with its use as a separatrix, or instead of a 
parenthesis. 

If, then, I have written a./3 and aX/5 for what is expressed in 
quaternions by — Sa/3 and Va/S, and in like manner V.o) and Vx«> 
for — SV<*) and VVw in quaternions, it is. because the natural develop¬ 
ment of a vector analysis seemed to lead logically to some such 
notations. But I think that I can show that these notations have 
some substantial advantages over the quaternionic in point of con¬ 
venience. 

Any linear vector function of a variable vector p may be expressed 
in the form— 

aX. p + /3/u. p -f yr. p = (aX + /3/tx 4- y ^). p = 4? • p, 
where = aX *4/3m 4- y^' ; 

or in quaternions 

— aSXp —/3SMP’~yS;^p= — (aSX-4/3SM-l-ySr)p = —^p, 

where <^ = aSX4*/3SM4-ySi^. 

If we take the scalar product of the vector <i>.p, and another vector 
cr, we obtain the scalar quantity 

O'. #. p = or. (aX 4-4* yr). p, 
or in quaternions So-0p = S(r(aSX4-^SM-|“ySv)p. 

This is a function of cr and of p, and it is exactly the same kind of 
function of cr that it is of p, a symmetry which is not so clearly 
exhibited in the quaternionic notation as in the other. Moreover, we 
can write cr.# for cr.(aX4-/3/u4-yr). This represents a vector which 
is a function of cr, viz., the function conjugate to #.ar; and <r.#.p 
may be regarded as the product of this vector and p. This is not 
so clearly indicated in the quaternionic notation, where it would be 
straining things a little to call S(r<p a vector. 

The combinations aX, jSiu, etc., used above, are distributive with 
regard to each of the two vectors, and may be regarded as a kind of 
product. If we wish to express everything in terms of i, j, and k, # 
will appear as a sum of ii, ijy ik, jiy jj, jk, ki, kj, kk, each with a 
numerical coefficient. These nine coefficients may be arranged in a 
square, and constitute a matrix; and the study of the properties of 
expressions like # is identical with the study of ternary matrices. 
This expression of the matrix as a sum of products (which may be 
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extended to matrices of any order) affords a point of departure from 
which the properties of matrices may be deduced with the utmost 
facility. The ordinary matricular product is expressed by a dot, as 
Other important kinds of multiplication may be defined by 
tlie equations— 

(a\)J(/V) = (aX/3)(\X/x), (aX):(^ya)-(a.^)(X.M)‘ 

With these definitions will be the determinant of and 

will be the conjugate of the reciprocal of multiplied by twice 
the determinant. If represents the manner in wliich vectors are 
affected by a strain, will represent tlie manner in which 

surfaces are affected, and the manner in which volumes 

are affected. Considerations of this kind do not attach themselves so 
naturally to the notation <p = jSS/i ySu, nor does the subject 
admit so free a development witli this notation, principally because 
tht^ symbol 8 refers to a special use of the matrix, and is very much 
in the way when we want to apply the matrix to other uses, or to 
subject it to various operations. 



VIII. 


QUATERNIONS AND THE AUSDEHNUNG8LEHRE. 

[Nature^ vol. XLiv. pp. 79-82, May 28, 1891.] 

The year 1844 is memorable in the annals of mathematics on account 
of the first appearance on the printed page of Hamilton’s Qimterniims 
and Grassmann’s AusdehnungHlehre. The former appeared in the 
July, October, and supplementary numbers of the Philosophical Maga¬ 
zine, after a previous communication to the Royal Irish Academy, 
November 13, 1843. This communication was indeed announced to 
the Council of the Academy four weeks earlier, on the very day of 
Hamilton’s discovery of (quaternions, as we learn from one of his 
letters. The author of the Ausdehnungslehre, although not uncon¬ 
scious of the value of his ideas, seems to have been in no haste to 
place himself on record, and published nothing until he was able 
to give the world the most characteristic and fundamental part of 
his system with considerable development in a treatise of more than 
300 pages, which appeared in August 1844. 

The doctrine of quaternions has won a conspicuous place among the 
various branches of mathematics, but the nature and scope of the 
Avsdehnungslehre, and its relation to quaternions, seem to be still 
the subject of serious misapprehension in quarters where we naturally 
look for accurate information. Historical justice, and the interests of 
mathematical science, seem to require that the allusions to the Ausdehn- 
ungslehre in the article on “ Quaternions ” in the last edition of the 
Encyclopcedia Britannica, and in the third edition of Prof. Tait’s 
Treatise on Qimtemions, should not be allowed to pass without protest. 

It is principally as systems of geometrical algebra that quaternions 
and the Ausdehnungslehre come into comparison. To appreciate the 
relations of the two systems, I do not see how we can proceed better 
than if we ask first what they have in common, then what either 
system possesses 'which is peculiar to itself. The relative extent and 
importance of the three fields, that which is common to the two 
systems, and those which are peculiar to each, will determine the 
relative rank of the geometrical algebras. Questions of priority can 
only relate to the field common to both, and will be much simplified 
by having the limits of that field clearly drawn. 
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Geometrical addition in three dimensions is common to the two 
systems, and seems to have been discovered independently both by 
Hamilton and Grassmann, as well as by several other persons about 
the same time. It is not probable that any especial claim for priority 
with respect to this principle will be urged for either of the two with 
which we are now concerned. 

The functions of two vectors which are represented in quaternions 
by Sa^ and Yafi are common to both systems as published in 1844, 
but the quaternion is peculiar to Hamilton’s. The linear vector 
function is common to both systems as ultimately developed, although 
mentioned only by Grassmann as early as 1844. 

To those already acquainted with quaternions, the first question will 
naturally he : To what extent are the geometrical methods which are 
usually called quaternionic peculiar to Hamilton, and to what extent 
are they common to Grassmann ? This is a question whicli anyone 
can easily decide for himself. It is only necessary to run one s eye 
over the equations used by quaternionic writers in the discussion of 
geometrical or physical subjects, and see how far they necessarily 
involve the idea of the quaternion, and how far they would be in¬ 
telligible to one understanding the functions Sa/5 and Va/3, but having 
no conception of the quaternion or at least could be made so by 
trifling changes of notation, as by writing S or V in places where 
they would not affect the value of the expressions. For such a test 
the examples and illustrations in treatises on quaternions would be 
manifestly inappropriate, so far as they are chosen to illustrate 
quaternionic principles, since the object may influence the form of 
presentation. But we may use any discussion of geometrical or 
physical subjects, where the writer is free to choose the form most 
suitable to the subject. I myself have used the chapters and sections 
in Prof. Tait’s Quateimions on the following subjects: Geometry of 
the straight line and plane, the sphere and cyclic cone, surfaces of 
the second degree, geometry of curves and surfaces, kinematics, statics 
and kinetics of a rigid system, special kinetic problems, geometrical 
and physical optics, electrodynamics, general expressions for the action 
between linear elements, application of V to certain pliysical analogies, 
pp. 160-371, except the examples (not worked out) at the close of the 
chapters. 

Such an examination will show that for the most part the methods 
of representing spatial relations used by quaternionic writers are 
common to the systems of Hamilton and Grassmann. To an extent 
comparatively limited, cases will be found in which the quaternionic 
idea forms an essential element in the signification of the equations. 

The question will then arise with respect to the comparatively 
limited field which is the peculiar property of Hamilton, How im- 
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portant are the advantages to be gained by the use of the quaternion ? 
This question, unlike the preceding, is one into which a personal 
equation will necessarily enter. Everyone will naturally prefer the 
methods with which he is most familiar; but I think that it may be 
safely affirmed that in the majority of cases in this field the advantage 
derived from the use of the quaternion is either doubtful or very 
trifling. There remains a residuum of cases in which a substantial 
advantage is gained by the use of the quaternionic method. Such 
cases, however, so far as my own observation and experience extend, 
are very exceptional. If a more extended and careful inquiry should 
show that they are ten times as numerous as I have found them, they 
would still be exceptional. 

We have now to inc^uire what we find in the Aunddmungnlthre. in 
tlie way of a geometrical algebra, that is wanting in quaternions. In 
addition to an algebra of vectors, the AusdehnuTKjslehra aflbrds a 
system of geometrical algebra in which the point is the fundamental 
element, and which for convenience I shall call Grassmann’s algebra 
of points. In this algebra we have first the addition of points, or 
quantities located at points, which may be explained as follows. The 
equation uA+bB+cC+ etc. = eE+/F+ etc., 

in which the capitals denote points, and the small letters scalars (or 
ordinary algebraic (juantities), signifies that 

a -(- ?> -I- c + etc. = e +/-b ete., 

and also that the centre of gravity of the weights a, b, e, etc., at the 
points A, B, C, etc., is the same as that of the weights c,/, etc., at 
the points E, F, etc. (It will be understood that negative weights are 
allowed as well as positive.) The equation is thus equivalent to four 
equations of ordinary algebra. In this Grassmann was anticipated by 
Mobius {Barycentinschcr Calculi 1827). 

We have next the addition of finite straight lines, or quantities 
located in straight lines (Liniengrossen). The meaning of the 
equation AB + CD + etc. = EF+GH + etc. 

will perhaps be understood most readily, if we suppose that each 
member represents a system of forces acting on a rigid body. The 
equation then signifies that the two systems are equivalent. An 
equation of this form is therefore equivalent to six ordinary equations. 
It will be observed that the Liniengrossen AB and CD are not simply 
vectors; they have not merely length and direction, but they are also 
located each in a given line, although their position within those 
lines is immaterial. In Clifford's terminology, AB is a rotor, AB -f CD 
a motor. In the language of Prof. Ball's Theory of Screws, AB-f-CD 
represents either a tv/ist or a wrench. 
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We have next the addition of plane surfaces {Plangrdasen). The 
equation ABC4-D£F+GHI = JKL 

signifies that the plane JKL passes through the point common to the 
planes ABC, DEF, and GHI, and that the projection by parallel lines 
of the triangle JKL on any plane is equal to the sum of the pro¬ 
jections of ABC, DEF, and GHI on the same plane, the areas being 
taken positively or negatively according to the cyclic order of the 
projected points. This makes the equation equiv’alent to four ordinary 
equations. 

Finally, we have the addition of volumes, as in the equation 

ABCD + EFGH = IJKL, 

where there is nothing peculiar, except that each term represents the 
six-fold volume of the tetrahedron, and is to be taken positiv^ely or 
negatively according to the relative position of the points. 

We have also multiplications as follows: The lino {Liniengrosae) 
AB is regarded as the product of the points A and B. The Plangrdssc 
ABC, which represents the double area of the triangle, is regarded as 
the product of the three points A, B, and C, or as the product of the 
line AB and the point C, or of BC and A, or indeed of BA and C. 
The volume ABCD, which represents six times the tetrahedron, is 
regarded as the product of the points A, B, C, and D, or as the 
product of the point A and the Pltmgrdsse BCD, or as the product of 
the lines AB and BC, etc., etc. 

This does not exhaust the wealth of multiplicative relations which 
Grassmann has found in the very elements of geometry. The 
following products are called regressive, as distinguished from the 
progressive, which have been described. The product of the Plan- 
grosser ABC and DEF is a part of the line in which the planes ABC 
and DEF intersect, which is equal in numerical value to the product 
of the double areas of the triangles ABC and DEF multiplied by the 
sine o^’ the angle made by the planes. The product of the Linien- 
grosse AB and the Plangrdsse CDE is the point of intersection of the 
line and the plane with a numerical coeflBcient representing the 
product of the length of the line and the double area of the triangle 
multiplied by the sine of the angle made by the line and the plane 
The product of three Plangrdasen is consequently the point common 
to the three planes with a certain numerical coefficient. In plane 
geometry we have a regressive product of two Liniengrossen, which 
gives the point of intersection of the lines with a certain numerical 
coeifficient. 

The fundamental operations relating to the point, line, and plane 
are thus translated into analysis by multiplications. The immense 
flexibility and power of such an analysis will be appreciated by 
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anyone who considers what generalized multiplication in connection 
with additive relations has done in other fields, as in quaternions, or 
in the theory of matrices, or in the algebra of logic. For a single 
example, if we multiply the equation 

AB + CD + etc. = EF + GH 4-etc. 
by PQ (F e-iid Q being any two points), we have 

ABPQ 4- CDPQ + ete. = EFPQ 4- GHPQ 4- etc., 
which will be recognised as expressing an important theorem of 
statics. 

The field in which Grassmann’s algebra of points, as distinguished 
from his algebra of vectors, finds its especial application and utility 
is nearly coincident with that in which, when we use the methods 
of ordinary algebra, tetrahedral or anharmonic coordinates are more 
appropriate than rectilinear. In fact, Grassmann's algebra of points 
may be regarded as the application of the methods of multiple algebra 
to the notions connected with tetrahedral coordinates, just as his or 
Hamilton’s algebra of vectors may be regarded as the application of 
the methods of multiple algebra to the notions connected with recti¬ 
linear coordinates. These methods, however, enrich the field to 
which they are applied with new notions. Thus the notion of the 
coordinates of a line in space, subsequently introduced by PlUcker, 
was first given in the Ausdehnungslehre of 1844. It should also be 
observed that the utility of a multiple algebra when it takes the plaice 
of an ordinary algebra of four coordinates, is very much greater 
than when it takes the place of three coordinates, for the same reason 
that a multiple algebra taking the place of three coordinates is very 
much more useful than one taking the place of two. Grassmanns 
algebra of points will always command the admiration of geometers 
and analysts, and furnishes an instrument of marvellous power to the 
former, and in its general form, as applicable to space of any number 
of dimensions, to the latter. To the physicist an algebra of points is 
by no means so indispensable an instrument as an algebra of vectors. 

Grassmann’s algebra of vectors, which we have described as co¬ 
incident with a part of Hamilton’s system, is not really anything 
separate from his algebra of points, but constitutes a part of it, the 
vector arising when one point is subtracted from another. Yet it 
constitutes a whole, complete in itself, and we may separate it from 
the larger system to facilitate comparison with the methods of 
Hamilton. 

We have, then, as geometrical algebras published in 1844, an 
algebra of vectors common to Hamilton and Grassmann, augmented 
on Hamilton’s side by the quaternion, and on Grassmann’s by 
his algebra of points. This statement should be made with the 
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reservation that the addition both of vectors and of points had 
been given by earlier writera 

In both systems as finally developed we have the linear vector 
function, the theory of which is identical with that of strains and 
rotations. In Hamilton’s system we have also the linear quaternion 
function, and in Grassmann’s the linear function applied to the 
quantities of his algebra of points. This application gives those 
transformations in which projective properties are preserved, the 
doctrine of reciprocal figures or principle of duality, etc. (Grass¬ 
mann’s theory of the linear function is, indeed, broader than this, 
being coextensive with the theory of matrices; but we are here 
considering only the geometrical side of the theory.) 

In his earliest writings on quaternions, Hamilton does not discuss 
the linear function. In his Lectures on Quaternions (1853), he treats 
of the inversion of the linear vector function, as also of the linear 
quaternion function, and shows how to find the latent roots of the 
vector function, with the corresponding axes for the case of real and 
unequal roots. He also gives a remarkable equation, the symbolic 
cubic, which the functional symbol must satisfy. This equation is 
a particular case of that which is given in Prof. Cayley’s classical 
Memoir on the Theory of Matrices (1858), and which is called by 
Prof. Sylvester the Hamilton-Cayley equation. In his Elements of 
Quaternions (1866), Hamilton extends the symbolic equation to the 
quaternion function. 

In Grassmann, although the linear function is mentioned in the 
first Ausdehnungslehre, we do not find so full a discussion of the 
subject until the second Ausdehnungslehre (1862), where he discusses 
the latent roots and axes, or what corresponds to axes in the general 
theory, the whole discussion relating to matrices of any order. The 
more difficult cases are included, as that of a strain in which all 
the roots are real, but there is only one axis or unchanged direction. 
On the formal side he shows how a linear function may be repre¬ 
sented by a quotient or sum of quotients, and by a sum of products, 
lAXckenausdruck. 

More important, perhaps, than the question when this or that 
theorem was first published is the question where we first find those 
notions and notations which give the key to the algebra of linear 
functions, or the algebra of matrices, cts it is now generally called. 
In vol. xxxi, p. 35, of Nature, Prof. Sylvester speaks of Cayley’s 
ever-memorable ” Memoir on Matrices as constituting *‘a second 
birth of Algebra, its avatar in a new and glorified form,” and refers 
to a passage in his Lectures on Universal Algebra, from which, I 
think, we are justified in inferring that this characterization of the 
memoir is largely due to the fact that it is there shown how matrices 
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may be treated as extensive quantities, capable of addition as well as 
of multiplication. This idea, however, is older than the memoir of 
1858. The Luckenauadruck, by which the matrix is expressed as a 
sum of a kind of products (LilckenJuiltig, or open), is described in a note 
at the end of the first Av^dehnungslehre. There we have the matrix 
given not only as a sum, but as a sum of products, introducing a 
multiplicative relation entirely different from the ordinary multipli¬ 
cation of matrices, and hardly less fruitful, but not lying nearly so 
near the surface as the relations to which Prof. Sylvester refers. The 
key to the theory of matrices is certainly given in the first Ausdehn- 
ungslehre^ and if we call the birth of matricular analysis the second 
birth of algebra, we can give no later date to this event than the 
memorable year of 1844. 

The immediate occasion of this communication is the following 
passage in the preface to the third edition of Prof. Tait s Quaternions : 

Hamilton not only published his theory complete, the year before 
the first (and extremely imperfect) sketch of the Ausdehnungslehre 
appeared; but had given ten years before, in his protracted study of 
Sets, the very processes of external and internal multiplication (corre¬ 
sponding to the Vector and Scalar parts of a product of two vectors) 
which have been put forward as specially the property of Grassrnann.** 

For additional information we are referred to art. “ Quaternions,” 
Encyc. BHi., where we read respecting the first Ausdehnungslehre: 

In particular two species of multiplication (‘ inner' and ‘ outer ’) of 
directed lines in one plane were given. The results of these two 
kinds of multiplication correspond respectively to the numerical and 
the directed parts of Hamilton’s quaternion product. But Grassmann 
distinctly states in his preface that he had not had leisure to extend 
his method to angles in space. . . . But his claims, however great 
they may be, can in no way conflict with those of Hamilton, whose 
mode of multiplying couples (in which the ‘ inner ’ and ‘ outer ’ multi¬ 
plication are essentially involved) was produced in 1833, and whose 
quaternion system was completed and published before Grassmann had 
elaborated for press even the rudimentary portions of his own system, 
in which the veritable difl[iculty of the whole subject, the application 
to angles in space, had not even been attacked.” 

I shall leave the reader to judge of the accuracy of the general 
terms used in these passages in comparing the first Ausdehnungslehre 
with Hamilton’s system as published in 1843 or 1844. The specific 
statements respecting Hamilton and Grassmann require an answer. 

It must be Hamilton’s Theory of Conjugate Functions or Algebraic 
Couples (read to the Royal Irish Academy, 1833 and 1835, and 
published in vol. xvii of the Transactions) to which reference is made 
in the statements concerning his “protracted study of Sets” and 
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“mode of multiplying couples.'' But I cannot find anything like 
Grassmann’s external or internal multiplication in this memoir, which 
is concerned, as the title pretty clearly indicates, with the theory of 
the complex quantities of ordinary algebra. 

It is difficult to understand the statements respecting the Ausdehn- 
ungalehre, which seem to imply that Grassmann’s two kinds of 
multiplication were subject to some kind of limitation to a plane. 
The external product is not limited in the first Ausdehnungslehre even 
to three dimensions. The internal, which is a comparatively simple 
matter, is mentioned in the first Ausdehnungslehre only in the preface, 
where it is defined, and placed beside the external pnxiuct as relating 
to directed lines. There is not the least suggestion of any difference 
in the products in respect to the generality of tlieir application to 
vectors. 

The misunderstanding seems to have arisen from the following 
sentence in Grassmann’s preface: “ And in general, in the consideration 
of angles in space, difficulties present themselves, for the complete 
(allseitig) solution of which I have not yet had sufficient leisure.” 
It is not surprising that Grassmann should have required more time 
for the development of some pai*ts of his system, when we consider 
that Hamilton, on his discovery of quaternions, estimated the time 
which he should wish to devote to them at ten or fifteen years (see 
his letter to Prof, Tait in the North British Review for September 
1866), and actually took several years to prepare for the press as 
many pages as Grassmann had printed in 1844. But any speculation 
as to the questions which Grassmann may have had principally in 
mind in the sentence quoted, and the particular nature of the diffi¬ 
culties which he found in them, however interesting from other points 
of view, seems a very precarious foundation for a comparison of the 
systems of Hamilton and Grassmann as published in the years 
1843-44. Such a comparison should be based on the positive evidence 
of doctrines and methods actually published. 

Such a comparison I have endeavoured to make, or rather to 
indicate the basis on which it may be made, so far as systems of 
geometrical algebra are concerned. As a contribution to analysis in 
general, I suppose that there is no question that Grassmann’s system 
is of indefinitely greater extension, having no limitation to any 
particular number of dimensions. 
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QUATERNIONS AND THE ALGEBRA OF VECTORS. 

[Nature, vol. XLVII. pp. 463, 464, Mar. 16, 1893.] 

In a recent number of Nature [vol. xlvii, p. 151], Mr. McAulay 
puts certain questions to Mr. Heaviside and to me, relating to a 
subject of such importance as to justify an answer somewhat at 
length. I cannot of course speak for Mr. Heaviside, although I 
suppose that his views are not very different from mine on the most 
essential points, but even if he shall have already replied before this 
letter can appear, I shall be glad to add whatever of force may belong 
to independent testimony. 

Mr. McAulay asks: “ What is the first duty of tlie physical vector 
analyst qud physical vector analyst?” The answer is not doubtful. 
It is to present the subject in such a form as to be most easily' 
acquired, and most useful when acquired. 

In regard to the slow progress of such methods towards recognition 
and use by pliysicists and others, w'hich Mr. McAulay deplores, it does 
not seem possible to impute it to any want of uniformity of notation. 
I doubt whether there is any modem branch of mathematics which 
has been presented for so long a time with a greater uniformity of 
notation than quaternions. 

What, then, is the cause of the fact which Mr. McAulay and all 
of us deplore ? It is not far to seek. We need only a glance at the 
volumes in which Hamilton set forth his method. No wonder that 
physicists and others failed to perceive the possibilities of simplicity, 
perspicuity, and brevity which were contained in a system presented 
to them in ponderous volumes of 800 pages. Perhaps Hamilton may 
have intended these volumes as a sort of thesaurus, and we should 
look to his shorter papers for a compact a^jcount of his method. But 
if we turn to his earlier papers on Quaternions in the Philosophicai 
Magazi/ney in which principally he introduced the subject to the 
notice of his contemporaries, we find them entitled “On Quaternions; 
or on a New System of Imaginaries in Algebra,” and in them we 
find a great deal about imaginaries, and very little of a vector 
analysis. To show how slowly the system of vector analysis 
developed itself in the quaternionic nidus, we need only say that 
the symbols S, V, and V do not appear until two or three years 
after the discovery of quaternions. In short, it seems to have been 
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only a secondary object with Hamilton to express the geometrical 
relations of vectors,—secondary in time, and also secondary in this, 
that it was never allowed to give shape to his work. 

But this relates to the past. In regard to the present status, I beg 
leave to quote what Mr. McAulay has said on another occasion (see 
Phil. Mag., June 1892):—“Quaternions differ in an important respect 
from other branches of mathematics that are studied by mathe¬ 
maticians after they have in the course of years of hard labour laid 
the foundation of all their future work. In nearly all cases these 
branches are very properly so called. They each grow out of a 
definite spot of the main tree of mathematics, and derive their 
sustenance from the sap of the trunk as a whole. But not so with 
quaternions. To let these grow in the brain of a mathematician, 
he must start from the seed as with the rest of his mathematics 
regarded as a whole. He cannot graft them on his already flourishing 
tree, for they will die there. They are independent plants that 
require separate sowing and the consequent careful tending.” 

Can we wonder that mathematicians, physicists, astronomers, and 
geometers feel some doubt as to the value or necessity of something 
so separate from all other branches of learning? Can that be a 
natural treatment of the subject which has no relations to any other 
method, and, as one might suppose from reading some treatises, has 
only occurred to a single man? Or, at best, is it not discouraging 
to be told that in order to use the quatemionic method, one must 
give up the progress which he has already made in the pursuit of 
his favourite science, and go bock to the beginning and start anew 
on a parallel course ? 

1 believe, however, that if what I have quoted is true of vector 
methods, it is because there is something fundamentally wrong in 
the presentation of the subject. Of course, in some sense and to some 
extent it is and must be true. Whatever is special, accidental, and 
individual, will die, as it should; but that which is universal and 
essential should remain as an organic part of the whole intellectual 
acquisition. If that which is essential dies with the accidental, it 
must be because the accidental has been given the prominence which 
belongs to the essential. For myself, I should preach no such doctrine 
to those whom I wish to convert to the true faith. 

In Italy, they say, all roads lead to Home. In mechanics, kine¬ 
matics, astronomy, physics, all study leads to the consideration of 
certain relations and operations. These are the capital notions; these 
should have the leading parts in any analysis suited to the subject. 

If I wished to attract the student of any of these sciences to an 
algebra for vectors, I should tell him that the fundamental notions 
of this algebra were exactly those with which he was daily con- 
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versant. I should tell him that a vector algebra is so far from being 
any one man’s production that half a century ago several were already 
working toward an algebra which should be primarily geometrical and 
not arithmetical, and that there is a remarkable similarity in the results 
to which these efforts led (see Proc. A.A.A,S. for 1886, pp. 37, ff.) [this 
voL p. 91, ff.]. I should call his attention to the fact that I^agrange 
and Gauss used the notation (a^y) to denote precisely the same as 
Hamilton by his S(a;8y), except that Lagrange limited the expression 
to unit vectors, and Gauss to vectors of which the length is the secant 
of the latitude, and I should show him that we have only to give up 
these limitations, and the expression (in connection with the notion 
of geometrical addition) is endowed with an immense wealth of 
transformations. I should call his attention to the fact that the 
notation [r{r^], universal in the theory of orbits, is identical with 
Hamilton’s ^(pip 2 )> except that Hamilton takes the area as a vector, 
i.e., includes the notion of the direction of the normal to the plane of 
the triangle, and that with this simple modification (and with the 
notion of geometrical addition of surfaces as well as of lines) this 
expression becomes closely connected with the first-mentioned, and is 
not only endowed with a similar capability for transformation, but 
enriches the first with new capabilities. In fact, I should tell him 
that the notions which we use in vector analysis are those which he 
who reads between the lines will meet on every page of the great 
meisters of analysis, or of those who have probed deepest the secrets of 
nature, the only difference being that the vector analyst, having regard 
to the weakness of the human intellect, does as the early painters 
who wrote beneath their pictures “ This is a tree,” " This is a horse.” 

I cannot attach quite so much importance as Mr. McAulay to 
uniformity of notation. That very uniformity, if it existed among 
those who use a vector analysis, would rather obscure than reveal 
their connection with the general course of modem thought in 
mathematics and physics. There are two ways in which we may 
measure the progress of any reform. The one consists in counting 
those who have adopted the shibboleth of the reformers; the other 
measure is the degree in which the community is imbued with the 
essential principles of the reform. I should apply the broader 
measure to the present ease, and do not find it quite so bad as 
Mr. McAulay does. 

Yet the question of notations, although not the vital question, is 
certainly important, and I assure Mr. McAulay that reluctance to 
make unnecessary innovations in notation has been a very powerful 
motive in restraining me from publication. Indeed my pamphlet on 
Vector Analysis^ which has excited the animadversion of quater- 
nionists, was never formally published, although rather widely 
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distributed, so long as I had copies to distribute, among those who 
I thought might be interested in the subject. 1 may say, however, 
since I am called upon to defend my position, that I have found 
the notations of that pamphlet more flexible than those generally 
used. Mr. McAulay, at least, will understand what I mean by this, 
if I say that some of the relations which he has thought of sufficient 
importance to express by means of special devices (see Proc, R.S.E. 
for 1890-91), may be expressed at least as briefly in the notations 
which I have used, and without special devices. But 1 should not 
have been satisfied for the purposes of my pamphlet with any 
notation which should suggest even to the careless reader any con¬ 
nection with the notion of the quaternion. For I confess that one 
of my objects was to show that a system of vector analysis does not 
require any support from the notion of the quaternion, or, 1 may 
add, of the imaginary in algebra. 

I should hardly dare to express myself with so much freedom, if 
I could not shelter myself behind an authority which will not be 
questioned. 

1 do not see that I have done anything very different from what 
the eminent mathematician upon whom Hamilton’s mantle has fallen 
has been doing, it would seem, unconsciously. Contrast the system of 
quaternions, which he has described in his sketch of Hamilton’s life 
and work in the North B'Htisk Review for September, 1866, with the 
system which he urges upon the attention of physicists in the Philo- 
acyphical Magazine in 1890. In 1866 we have a great deal about 
imaginaries, and nearly as much about the quaternion. In 1890 we 
have nothing about imaginaries, and little about the quaternion. 
Prof. Tait has spoken of the calculus of quaternions as throwing off 
in the course of years its early Cartesian trammels. I wonder that 
he does not see how well the progress in which he ha's led may be 
descrilxjd as throwing off the yoke of the quaternion. A characteristic 
example is seen in the use of the symbol V. Hamilton applies this to 
a vector to form a quaternion, Tait to form a linear vector function. 
But while breathing a new life into the formulae of quaternions, 
Prof. Tait stands stoutly by the letter. 

Now I appreciate and admire the generous loyalty toward one 
whom he regards as his master, which has always led Prof. Tait to 
minimise the originality of his own work in regard to quaternions, 
and write as if everything weis contained in the ideas which flashed 
into the mind of Hamilton at the classic Brougham Bridge. But not 
to speak of other claims of historical justice, we owe duties to our 
scholars as well as to our teachers, and the world is too large, and the 
current of modern thought is too broad, to be confined by the ipse 
dixit even of a Hamilton. 
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QUATERNIONS AND VECTOR ANALYSIS. 

[Nature, vol. XLViii. pp. 364-367, Aug. 17, 1893.] 

In a paper by Prof. C. G. Knott on ‘'Recent Innovations in Vector 
Theory,” of which an abstract lias been given in Nature (vol. xlvii, 
pp. 590-593; see also a minor abstract on p. 287), the doctrine that 
the quaternion affords the only sufficient and proper basis for vector 
analysis is maintained by arguments based so largely on the faults 
and deficiencies which the author has found in my pamphlet, Elements 
of Vector Analysis, as to give to such faults an importance which 
tliey would not otherwise possess, and to make some reply from me 
necessary, if I would not discredit the cause of non-qiiaternionic 
vector analysis. Especially is this true in view of the warm com¬ 
mendation and endorsement of the paper, by Prof. Tait, which 
appeared in Naitcre somewhat earlier (p. 225). 

The charge which most requires a reply is expressed most distinctly 
in the minor abstract, viz., " that in the development of his dyadic 
notation, Prof, Gibbs, being forced to bring the quaternion in, logically 
condemned his own position.” This wfis incomprehensible to me until 
1 received the original paper, whore I found the charge specified as 
follows: “ Although Gibbs gets over a good deal of ground without 
the explicit recognition of the complete product, which is the difference 
of his ‘skew’ and ‘direct’ products, yet even he recognises in plain 
language the versorial character of a vector, brings in the quaternion 
whose vector is the difference of a linear vector function and its 
conjugate, and does not hesitate to use the accursed thing itself in 
certain line, surface, and volume integrals” (Proc. R.S.E., Session 
1892-3, p. 236). These three specifications I shall consider in their 
inverse order, premising, however, that the epitheta omantia are 
entirely my critic’s. 

The hist charge is due entirely to an inadvertence. The integrals 
referred to are those given at the close of the major abstract in 
Nature (p. 593). My critic, in his original paper, states quite correctly 
that, according to my definitions emd notations, they should represent 
dyadics. He multiplies them into a vector, introducing the vector 
under the integral sign, as is perfectly proper, provided, of course, 
that the vector is constant. But failing to observe this restriction, 
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evidently through inadvertence, and finding that the resulting 
equations (thus interpreted) would not be true, he concludes that I 
must have meant something else by the original equations. Now, 
these equations will hold if interpreted in the quatemionic sense, as is, 
indeed, a necessary consequence of their holding in the dyadic sense, 
although the converse would not be true. My critic was thus led, 
in conseijuence of the inadvertence mentioned, to suppose that I had 
departed from my ordinary usage and my express definitions, and 
had intended the products in these integrals to be taken in the 
quatemionic sense. This is the sole ground for the last charge. 

The sexjond charge evidently relates to the notations and (s^e 
Nature, vol. xlvii, p. 592). It is perfectly true that I have used a 
scalar and a vector connected with the linear vector operator, which, 
if combined, would form a quaternion. I have not thus combined 
them. Perhaps Prof. Knott will say that since I use both of them it 
matters little whether I combine them or not. If so I heartily agree 
with him. 

The first charge is a little vague. I certainly admit that vectors 
may be used in connection with and to represent rotations I have no 
objection to calling them in such cases versoriai. In that sense 
Lagrange and Poinsot, for example, used versorial vectors. But what 
has this to do with quaternions ? Certainly Lagrange and Poinsot 
were not (luaternionists. 

The passage in the major abstract in Nature whicli mf)st distinctly 
charges me with the use of the quaternion is that in which a certain 
expression which I use is said to represent the quaternion operator 
q{ )q~^ (vol. xlvii, p. 592). It would be more accurate to say that 
my expression and the quatemionic expression represent the same 
operator. Does it follow that I have used a quaternion ? Not at all. 
A quatemionic expression may represent a number. Does everyone 
who uses any expression for that number use quaternions ? A 
quatemionic expression may represent a vector. Does e^ eryone who 
uses any expression for that vector use quaternions ? A quatemionic 
expression may represent a linear vector operator. If I use an 
expression for that linear vector operator do I therefore use quater¬ 
nions ? My critic is so anxioas to prove that I use quaternions that 
he uses arguments which would prove that quaternions were in 
common use before Hamilton was born. 

So much for the alleged use of the quaternion in my pamphlet. 
Let us now consider the faults and deficiencies which have been found 
therein and attributed to the want of the (juatemion. The most 
serious criticism in this respect relates to certain integrating operators, 
which Prof. Tait unites with Prof. Bmott in ridiculing. As definitions 
are wearisome, I will illustrate the use of the terms and notations 
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which I have used by quoting a sentence addressed to the British 
Association a few years ago. The speaker was Lord Kelvin. 

“ Helmholtz first solved the problem—Given the spin in any case 
of liquid motion, to find the motion. His solution consists in finding 
the potentials of three ideal distributions of gravitational matter 
having densities respectively equal to I/tt of the rectangular com¬ 
ponents of the given spin; and, regarding for a moment these 
potentials as rectangular components of velocity in a case of liquid 
motion, taking the spin in this motion as the velocity in the required 
motion ” {Nature^ voL xxxviii, p. 569). 

In the terms and notations of my pamphlet the problem and 
solution may be thus expressed : 

Given the curl in any case of liquid motion—to find the motion. 

The required velocity is 1/4^ of the curl of the potential of the 
given curl. ^ 

Or, more briefly—The required velocity is of the Laplacian 
of the given curl. 

Or in purely analytical form—Required a) in terms of when 

V. ft) = 0. 

Solution: <*> = 1 /47rV x Pot Vx<*) = 1 /47r Lap Vx «*>. 

(The Laplacian expresses the result of an operation like that by 
which magnetic force is calculated from electric currents distributed 
in space. This corresponds to the second form in which Helmholtz 
expressed his result.) 

To show the incredible rashness of my critics, I will remark that 
these equations are among those of which it is said in the original 
paper (Proc. R.S.E., Session 1892-93, p. 225), “ Gibbs gives a good 
many equations—theorems I suppose they ape at being.” I may add 
that others of the equations thus characterized are associated with 
names not leas distinguished than that of Helmholtz. But that to 
which I wish especially to call attention is that the terms and 
notations in question express exactly the notions which physicists 
want to use. 

But we are told {Nature, vol. xlvii, p. 287) that these integrating 
operators (Pot, Lap) are best expressed as inverse functions of V. 
To see how utterly inadequate the Nabla would have been to express 
the idea, we have only to imagine the exclamation points which the 
members of the British Association would have looked at each other 
if the distinguished speaker had said: 

Helmholtz first solved the problem—Given the Nabla of the velocity 
in any case of liquid motion, to find the velocity. His solution was 
that the velocity was the Nabla of the inverse square of Nabla of 
the Nabla of the velocity. Or, that the velocity was the inverse 
Nabla of the Nabla of the velocity. 
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Or, if the problem and solution had been written thus: Required w 
in terms of Vw when SVoo — 0. 

Solution: <w = = V-^Vw. 

My critic has himself given more than one example of the unfitness 
of the inverse Nabla for the exact expression of thought. For 
example, when he says that I have taken “eight distinct steps to 
prove two equations, which are special cases of 

I do not quite know what he means. If he means that I have taken 
eiglit steps to prove Poisson’s Equation (which certainly is not ex¬ 
pressed by the equation cited, although it may perhaps be associated 
with it in some minds), I will only say that my proof is not very 
long, especially as I have aimed at greater rigor than is usually 
thought necessary. I cannot, however, compare my demonstration 
with that of qiiaternionic writers, as I have not been able (doubtless 
on account of insufficient search) to find any such. 

To show how little foundation there is for the charge that the 
deficiencies of my system require t/O be pieced out by these integral 
operators, I need only say that if I wished to economise operators I 
might give up New, Lap, and Max, writing for them VPot, VxPot, 
and V.Pot, and if I wished further to economise in what costs so little, 
I could give up the potential also by using the notation (V.V)”^ or 
V-2 That is, I could have used this notation without greater sacrifice 
of precision than quatemionic writers seem to be willing to make. I 
much prefer, however, to avoid these inverse operators as essentially 
indefinite. 

Nevertheless—although my critic has greatly obscured the subject 
by ridiculing operators, which I beg leave to maintain are not worthy 
of ridicule, and by thoughtlessly asserting that it was necessary for 
me to use them, whereas they are only necessary for me in the sense 
in which something of the kind is necessary for the quatemionist also^ 
if he would use a notation irreproafehable on the score of exactness— 
I desire to be perfectly candid. I do not wish to deny that the 
relations connected with these notations appear a little more simple 
in the quatemionic form. I had, indeed, this subject principally in 
mind when I said two years ago in Nature (vol. xliii, p. 512) [this vol. 
p. 158]: “ There are a few formulae in which there is a trifling gain 
in compactness in the use of the quaternion.'* Let us see exactly 
how much this advantage amounts to. 

There is nothing which the most rigid quatemionist need object to 
in the notation for the potential, or indeed for the Newtonian. These 
represent respectively the operations by which the potential or the 
force of gravitation is calculated from the density of matter. A 
quatemionist would, however, apply the operator New not only to a 
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scalar, as I have done, but to a vector also. The vector part of New w 
(construed in the (]uaternionic sense)'would be exactly what I have 
represented by Lap w, and the scalar part, taken negatively, would be 
exactly what I have represented by Max a\ The quatemionist has 
here a slight economv in notations, which is of less importance, since 
all the operators—New, Lap, Max—may be expressed without 
ambiguity in terms of the potential, which is therefore the only one 
necessary for the exact expression of thought. 

But what are the formulae which it is necessary for one to remember 
who uses my notations ? Evidently only those which contain the 
operabjr Pot. F(jr all the others are derived from these by the simple 
substitutions New = V Pot, 

Lap =VxPot, 

Max = V. Pot. 

Whether one is quatemionist or not, one must remember Poisson’s 
Equation, which 1 write 

V. V Pot Ct) = — 
and in quaternionic might be written 
Pot CO = l-Trco. 

If c) is a vector, in using my equations one has also to remember the 
general formulae, 

V. Vco = VV. o)—V X ^ X 

which as applied to the present case may be united with the preceding 
in the three-rnembered equation, 

V.V Pot w = VV. Pot CO -- VX VXPot (0 = — 4'7rte>. 

This single equation is a])solutely all that there is to burden the 
memcry of the student, except that the symbols of differentiation 
(V, Vx, V.) may be placed indifferently before or after the symbol 
for the potential, and that if we choose we may substitute as above 
Xrw for V Pot, etc. Of course this gives a good many ecjuations, 
which on account of the importance of tlie subject (as they might 
almost be said to give the mathematics of the electro-magnetic field) 
1 liave written out more in detail than might seem necessary. I have 
also called the attention of the student to many things, which perhaps 
he might be left to himself to see. Prof. Knott says that the quater- 
nionist obtains similar equations by the simplest transformations. 
He has failed to observe that the same is true in my Vecto-r Avalytiis, 
when once I have proved Poisson’s Equation. Perhaps he takes hie 
model of brevity from Prof. Tail, who simplifies the subject, 1 believe, 
in his treatise cm Quaternions, by taking this theorem for granted. 

Nevertheless, since I am forced so often to disagree with Prof. 
Knott, I am glad to agree with him when I can. He says in his 
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original paper (p. 226), “No finer argument in favour of the real 
quaternion vector analysis can be found than in the tangle and the 
jangle of sections 91 to 104 in the Elements of Vector Analy.^isr 
Now I am quite ready to plead guilty to the tangle. The sections 
mentioned, as is sufficient!}’ evident to the reader, were written at 
two difterent times, sections 102-1(54 being an addition after a couple 
of years. The matter of these latter sections is not found in its 
natural place, and the result is well enough characterised as a 
It certainly does credit to the conscientious study which Prof. Knott 
has given to my pamphlet, that he has discovered that there is a 
violent dislocation of ideas just at this point. For such a fault of 
composition I have no sufficient excuse to ofier, but I must protest 
against its being made the ground of any broad conclusions in regard 
to the fundamental importance of the quaternion. 

Prof. Knott next proceeds to criticise—or at least to ridicule—my 
treatment of the linear vector function, with respect to which we read 
in the abstract:—“ As developed in the pamphlet, the theoiy of the 
dyadic goes over much the same ground as is traversed in the last 
chapter of Kelland and Tait’s Introditction to Qiiaternions, With 
the exception of a few of those lexicon products, for which Prof. 
Gibbs has such an affection, there is nothing of real value added to 
our knowledge of the linear vector function.” It would not, I think, 
be difficult to show some inaccuracy in my critic’s characterisation 
of the real content of this part of my pamphlet. But as algebra is 
a formal science, and as the whole discussion is concerning the best 
form of representing certain kinds of relations, the important question 
would seem to be whether there is anything of formal value in my 
treatment of the linear vector function. 

Now Prof. Knott distinctly characterises in half a dozen words the 
difference in the spirit and method of my treatment of this subject 
from that which is traditional among quaternionists, when he says 
of what I have called dyadics—“these are not quantities, but operators” 
{Nature, vol. xlvii, p. 592), I do not think that I applied the word 
quantity to the dyadics, but Prof. Knott recognised that I treated 
them as quantities—not, of course, as the quantities of arithmetic, or 
of ordinary algebra, but as quantities in the broader sense, in which, 
for example, quaternions are called quantities. The fact that they 
may be operators does not prevent this. Just as in grammar verbs 
may be taken as substantives, viz., in the infinitive mood, so in algebra 
operators—especially such as are capable of quantitative variation— 
may be regarded as quantities when they are made the subject of 
algebraic comparison or operation. Now I would not say that it is 
necessary to treat every kind of operator as quantity, but I certainly 
think that one so important as the linear vector operator, and one 
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which lends itself so well to such broader treatment, is worthy of it. 
Of course, when vectors are treated by the methods of ordinary 
algebra, linear vector operators will naturally be treated by the same 
methods, but in an algebra formed for the, sake of expressing the 
relations between vectors, and in which vectors are treated as multiple 
quantities, it would seem an incongruity not to apply the methods 
of multiple algebra also to the linear vector operator. 

The dyadic is practically the linear vector operator regarded as 
quantity. More exactly it is the multiple quantity of the ninth order 
which affords various operators according to the way in which it is 
applied. I will not venture to say what ought to be included in a 
treatise on quaternions, in which, of course, a gofxi many subjects 
would have claims prior to the linear vector operator; but for the 
purposes of my pamphlet, in which the linear vector operator is one 
of the most important topics, I cannot but regard a treatment like 
that in Hamilton’s Lectures, or Elements, as wholly inadequate on, 
the formal side. To show what I mean, I have only to compare 
Hamilton’s treatment of the quaternion and of the linear vector 
operator with respect to notations. Since quaternions have been 
identified with matrices, while the linear vector operator evidently 
belongs to that class of multiple quantities, it seems unreasonable to 
refuse to the one those notations which we grant to the other. Thus, 
if the quaternionist has 6^, log q, sin q, cos q, why should not the vector 
analyst have e^, log sin cos where ^ rcpres#^nts a linear vector 
operator ? I suppose the latter are at least as useful to the physicist. 
I mention these notations first, because here the analogy is most 
evident. But there are other case^ far more important, because more 
elementary, in which the analogy is not so near the surface, and 
therefore the difference in Hamilton’s treatment of the two kinds of 
multiple quantity not so evident. We have, for example, the tensor 
of the quaternion, which has the important property represented by 
the equation: T(gr) = Tg'Tr. 

There is a scalar quantity related to the linear vector operator, 
which I have represented by the notation |#| and called the 
determinant of It is in fact the determinant of the matrix by 
which ^ may be represented, just as the square of the tensor of q 
(sometimes called the norm of q) is the determinant of the matrix by 
which q tnay be represented. It may also be defined as the product 
of the latent roots of just as the square of the tensor of q might be 
defined as the product of the latent roots of q. Again, it has the 
property represented by the equation 

which corresponds exactly with the preceding equation with both 
sides squared. 
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There is another scalar quantity connected with the quaternion and 
represented by the notation Sq. It has the important property 
expressed by the e(]uation, 

S(qrs) = S(rsq) = S(sgr), 

and so for products of any number of quaternions, in which the cyclic 
order remains unchanged. In the theory of the linear vector operator 
there is an important quantity which I have represented by the 
notation and which has the property represented by the equation 

where the numbc^r of the factors is as before immaterial, ^g may be 
defined as the sum of the latent roots of just as 2Sg may be 
defined as the sum of the latent roots of q. 

The analogy of these notations may be further illustrated by 
comparing the equations 

and = 

I do not see why it is not as reasonable for the vector analyst to 
have notations like | ^ j and ^g, as for the quabirnionist to have the 
notations T^ and S^. 

This is of course an argumentnm ad quatemionisten. 1 do not 
pretend that it gives the reason why I used these notations, for the 
identification of the quaternion with a matrix was, I think, unknown 
to me when I wrote my pamphlet. The real justification of the 
notations | ^ | and ^g is that they express functions of the linear 
vector operator qud quantity, which physicists and others have 
continually occasion to use. And this justification applies to other 
notations which may not have their analogues in quaternions. Thus 
I have used to express a vector so important in the theory of the 
linear vector operator-, that it can hardly be neglected in any treatment 
of the subject. It is described, for example, in treatises as different as 
Thomson and Taits Natural Philosophy and Kelland and Tait's 
Quaternions. In the former treatise the components of the vector 
are, of course, given in terms of the elements of the linear vector 
operator, which is in accordance with the method of the treatise. In 
the latter treatise the vector is expressed by 

V aa -h V/8/8' 4* V yy. 

As this supposes the linear vector operator to l^e given not by a 
single letter, but by several vectors, it must be regarded as entirely 
inadequate by any one who wishes to treat the subject in the spirit of 
multiple algebra, i.e. to use a single letter to represent the linear 
vector operator. 

But my critic does not like the notations | # |, 4»8, <i»x- His ridicule, 
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indeed reaches high-water mark in the paragraphs in which he men¬ 
tions them. Concerning another notation, (defined in Nature, vol. 
xliii, p. 513) [this vol., p. 160], he exclaims, “ Thus burden after burden, 
in the form of new notation, is added apparently for the sole purpose 
of exercising the faculty of memory.” He would vastly prefer, it 
would appear, to write with Hamilton “where m representt^ 

what the unit volume becomes under the influence of the linear 
operator.” But this notation is only apparently compact, since the 
m requires explanation. Moreover, if a strain were given in what 
Hamilton calls the standard trinomial form, to write out the formula 
for the operator on surfaces in that standard form by the use of the 
expression would require, it seems to me, ten (if not fifty) times 

the effort of memory and of ingenuity, which would be required for 
the same purpose with the use of 

I may here remark that Prof. TaiUs letter of endorsement of Prof. 
Knott’s paper affords a striking illustration of the convenience and 
flexibility of a notation entirely analogous to viz., He 

gives the form SW^ Scrcr^ to illustrate the advantage of quaternionic 
notations in point of brevity. If I understand his notation, this is 
what I should write VcriVcr. (I take for granted that the suffixes 
indicate that V applies as differential operator to or, and to (Ti, 
(T and (Tj being really identical in meaning, as also V and V^.) It 
will be observed that in my notation one dot unites in multiplication 
the two V’s, and the other the two or’s, and that I am able to leave 
each V where it naturally belongs as differential operator. The 
quaternionist cannot do this, because the V and cr cannot be left 
together without uniting to form a quaternion, which is not at all 
wanted. Moreover, I can write # for Vcr, and for VcriVcr. The 
quaternionist also uses a 0, which is practically identical with my 
(viz., the operator which expresses the relation between d<r and dp), 
but I do not see how Prof. Knott, who I suppose dislikes ^^ as 
much as would express SVV^ So-cTj in terms of this 0. 

It is characteristic of Prof. Knott’s view of the subject, that in 
translating into quaternionic from a dyadic, or operator, as he calls it, 
he adds in each case an operand. In many cases it would be difficult 
to make the translation without this. But it is often a distinct 
advantage to be able to give the operator without the operand. For 
example, in translating into quaternionic my dyadic or operator <^X/o, 
he adds an operand, and exclaims, “ The old thing ! ” Certainly, when 
this expression is applied to an operand, there is no advantage (and 
no disadvantage) in my notation as compared with the quaternionic. 
But if the quaternionist wished to express what I would write in the 
form (#X/^)■^ or I^Xpl, or (^Xp)b, or (#x/))x, he would, I think 
find the operand very much in the way. 
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ON DOUBLE REFRACTION AND THE DISPERSION OF 
COLORS IN PERFECTLY TRANSPARENT MEDIA. 

[American Journal of Science^ ser. 3, vol. XXIII, pp. 262-275, April, 1882.] 

1. In calculating the velocity of a system of plane waves of homo¬ 
geneous light, regarded as oscillating electrical fluxes, in transparent 
and sensibly homogeneous bodies, whether singly or doubly refracting, 
we may assume that such a body is a very fine-grained structure, so 
that it can be divided into parts having their dimensions very small 
in comparison with the wave-length, each of which may be regarded 
as entirely similar to every other, while in the interior of each there 
are wide differences in electrical as in other physical properties. 
Hence, the average electrical displacement in such parts of the body 
may be expressed as a function of the time and the coordinates of 
position by the ordinary equations of wave-motion, while the real 
displacement at any point will in general differ greatly from that 
represented by such equations. 

It is the object of this paper to investigate the velocity of light in 
perfectly transparent media which have not the property of circular 
polarization in a manner which shall take account of this difference 
between the real displacements and those represented by the ordinary 
equations of wave-motion. We shall find that this difference will 
account for the dispersion of colors, without affecting the validity 
of the laws of Huyghens and Fresnel for double refraction with 
respect to light of any one color. 

In this investigation, it is assumed that the electrical displacements 
are solenoidal, or, in other words, that they are such as not to produce 
any change in electrical density. The disturbance in the medium is 
treated as consisting entirely of such electrical displacements and 
fluxes, and not complicated by any distinctively magnetic phenomena. 
It might therefore be more accurate to call the theory (as here 
developed) electrical rather than electromagnetic. The latter term is 
nevertheless retained in accordance with general usage, and with that 
of the author of the theory. 

Since the velocity which we are seeking is equal to the wave-length 
divided by the period of oscillation, the problem reduces to finding 
the ratio of these quantities, and may be simplified in some respects 
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by supposing that we have to do with a system of stationary waves. 
That the relation of the wave-length and the period is the same for 
stationary as for progressive waves is evident from the consideration 
that a system of stationary waves may be formed by two systems of 
progressive waves having opposite directions. 

2 . Let Xy y, z he the rectangular coordinates of any point in the 

medium, which with the system of waves we may regard as inde¬ 
finitely extended, and let f be the components of 

electrical displacement at that point at the time t \ f being the 
average values of the components of electrical displacement at that 
time in a wave-plane passing through the point. Then f' 

are perfectly defined quantities, of which f are connected with 
Xy y, z, and i by the ordinary equations of wave-motion, while each 
of the quantities ri\ has always zero for its average value in any 
wave-plane. We may call rjy f the components of the regvZar part 
of the displacement, and f', ri\ the components of the irregular 
part of the displacement. In like manner, the differential coefficients 
of these quantities with respect to the time, 17 , ti\ f', may be 

called respectively the components of the regular part of the flux, 
and the components of the irregular part of the flux. 

Let the whole space be divided into elements of volume Du, very 
small in all dimensions in comparison with a wave-length, but 
enclosing portions of the medium which may be treated as entirely 
similar to one another, and therefore not infinitely small. Thus a 
crystal may be divided into elementary parallelopipeds, all the vertices 
of which are similarly situated with respect to the internal structure 
of the crystal. Amorphous solids and liquids may not be capable of 
division into equally small portions of which physical similarity can 
be predicated with the same rigor. Yet we may suppose them capable 
of a division substantially satisfying the requirements. 

From these definitions it follows that at any given instant the 
average value of each of the quantities in an element Du is 

zero. For the average value in one such element must be sensibly 
the same as in any other situated on the same wave-plane. If this 
average were not zero, the average for the wave-plane would not be 
zero. Moreover, at any given instant, the values of ri, ^ may be 
regarded as constant throughout any element Du, and as representing 
the average values of the components of displacement in that element. 
The same will be true of the quantities 7 ', f' and jy, f. 

3 . Since we have excluded the case of media which have the pro¬ 
perty of circular polarization, we shall not impair the generality of 
our results if we suppose that we have to do with linearly polarized 
light, i.e., that the regular part of the displacement is everywhere 
parallel to the same fixed line, all cases not already excluded being 
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reducible to this. Then, with the origin of coordinates and the zero 
of time suitably chosen, the regular part of the displacement may be 
represented by the equations 

t— a cos cos 27r- , 

^ I p 

— ^ cos 27r^ cos 27r^, . (1) 

f == y cos 2xy cos 27r^ , 

6 p 

where I denotes the wave-length, the period of vibration, a, jS, y 
the maximum amplitudes of the displacements t), and u the 
distance of the point considered from the wave-plane which passes 
through the origin. Since u is a linear function of a\ y, and z, we 
may regard these equations as giving the values of rj, f, for a given 
system of waves, in terms of x, y, z, and t. 

4. The components of the irregular displacement, tj\ at any 
given point, will evidently be simple harmonic functions of the time, 
having the same period as the regular part of the displacement. That 
they will also have the same phase is not quite so evident, and would 
not be the case in a medium in which there were any absorption or 
dispersion of light. It will however appear from the following con¬ 
siderations that in perfectly transparent media the irregular oscil¬ 
lations are synchronous with the regular. For if they are not 
synchronous, we may resolve the irregular oscillations into two parts, 
of which one shall be synchronous with the regular oscillations, and 
the other shall have a difference of phase of one-fourth of a complete 
oscillation. Now if the medium is one in which there is no absorption 
or dispersion of light, we may assume that the same electrical con¬ 
figurations may also be passed through in the inverse order, which 
would be represented analytically by writing --1 for t in the equations 
which give tf', f', as functions of x, y, z, and t. But this 

change would not affect the regular oscillations, nor the synchronous 
part of the irregular oscillations, which depends on the cosine of the 
time, while the non-synchronous part of the irregular oscillations, 
which depends on the sine of the time, would simply have its 
direction reversed. Hence, by taking first one-half the sum, and 
secondly one-half the difference, of the original motion and that 
obtained by substitution of — < for t, we may separate the non- 
synchronous part of the irregular oscillations from the rest of the 
motion. Therefore, the supposed non-synchronous part of the irregular 
displacement, if capable of existence, is at least wholly independent 
of the wave-motion and need not be considered by us. 

We may go farther in the determination of the quantities rf, f'. 
For in view of the very fine-grained structure of the medium, it will 
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easily appear that the manner in which the general or average flux 
in any element Dv (represented by rj, f) distributes itself among 
the molecules and intermolecular spaces must be entirely determined 
by the amount and direction of that flux and its period of oscillation. 
Hence, and on account of the superposable character of the motions 
which we are considering, we may conclude that the values of rj', f' 
at any given point in the medium are capable of expression as linear 
functions of f in a manner which shall be independent of the 

time and of the orientation of the wave-planes and the distance of 
a nodal plane from the point considered, so long as the period of 
oscillation remains the same. But a change in the period may 
{jresurnably affect the relation between rj', and rj, ^ to & certain 
extent. And the relation between jj', and f will vary 

rapidly as we pass from one point to another within the element Dy. 

5. In the motion which we are considering there occur alternately 
instants of no velocity and instants of no displacement. The statical 
energy of the medium at an instant of no velocity must be equal to 
its kinetic energy at an instant of no displacement. Let us examine 
each of these quantities, and consider the equation which expresses 
their e<|uality. 

6 . Since in every part of an element D?’ the irregular as well as the 
regular part of the displacement is entirely determined (for light of 
a given period) by the values of t}, the statical energy of the 
element must be a quadratic function of fj, say 

{Ae + B172+Cf + Eiyf + Gif}) Di., 

where A, B, etc. depend only on the nature of the medium and the 
period of oscillation. At an instant of no velocity, when 

8in27r-=0, and cos®27r^ = l, 

Y> p 

the above expression will reduce by equations (1) to 

(Aa^ + B5^-f Cy^ + E/Sy -f Fya -f* Gay8)cos®27r j Di'. 

Since the average value of co8*27r~ in an indefinitely extended 
space is we have for the statical energy in a unit of volume 

S = i(A«"4-B/32-fC-P‘q.E^y4-Fya-hGa^). (2) 

7. The kinetic energy of the whole medium is represented by the 
double volume-inteerral* 

dv^, 

* The fluxes are supposed to be measured by the electromagnetio system of units. It 
is to be observed that the difference of opinion which has prevailed with respect to the 
estimation of the energy of electrical currents does not extend to such as are mlemidaly 
which may be regarded as composed of closed circuits. 
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where dv^y dv^ are two infinitesimal elements of volume, 

the corresponding components of flux, r the distance between 
the elements, and S denotes a summation with respect to the 
coordinate axes. Separating the integrations, we may write for 
the same quantity 


It is evident that the integral within the brackets is derived from 
same process by which the potential of any mass is 
derived from its density. If we use the symbol Pot to express this 
relation, we may write for the kinetic energy 

The operation denoted by this symbol is evidently distributive, 
so that Pot = Pot ^4-Pot The expression for the kinetic 

energy may therefore be expanded into 

iS f$ Pot idv + Pot f dr 4- is/f Pot ^ dr + JE/ $' Pot $' dr. 


But and therefore Pot^', has in every wave-plane the average 
value zero. Also and therefore Pot^, has in every wave-plane a 
constant value. Therefore the second and third integrals in the 
above expression will vanish, leaving for the kinetic energy 

JE/^Pot^dr-l-JE/rPotf dr, (3) 

which is to be calculated for a time of no displacement, when 


2 'ra it 

: ±-COS ZTT -7^, n - 

V I 


27rj8 ^ u , 

■ ± - COS 27r -7, ( 

P I ^ 




= ± ——^cos 2 x-7. 

F I 


(4) 


The form of the expression (3) indicates that the kinetic energy 
consists of two parts, one of which is determined by the regular part 
of the flux, and the other by the irregular part of the flux. 

8 . The value of Pot ^ may be easily found by integration, but 
perhaps more readily by Poisson’s well-known theorem, that if q is 
any function of position in space (as the density of a certain mass), 

d^Voiq cZ^Poto #Poto 

-^+-dy- 


where the direction of the coordinate axes is immaterial, provided 
that they are rectangular. In applying this to Pot we may place 
two of the axes in a wave-plane. This will give 


d- Pot i 


~47r^. 


( 6 ) 


In a nodal plane, Pot^=0, since ( has equal positive and negative 
values in elements of volume symmetrically distributed with respect 
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to any point in such a plane. In a wave-crest (or plane in which 
^ has a maximum value), Pot^ will also have a maximum value, 
which we may call K. For intermediate points we may determine 
its value from the consideration that the total disturbance may be 
resolved into two systems of waves, one having a wave-crest, and the 
other a nodal plane passing through the point for which the potential 
is sought. The maximum amplitudes of these component systems 
will be to the maximum amplitude of the original system as 

cos 2x j and sin 27r ~ to unity. But the scicond of the component 

systems will contribute nothing to the value of the potential. We 
thus obtain 

Pot ^=K cos Stt-, 


(;2^Pot^_ 47r^ 

du^ 


Kcos 2 


u 


4x2 


Comparing this with equation (6), we have 

(7) 

Hence, and by equations (4), 

i^/i?otidv = = (a^ + ^+y^)fcoB^ 2^ ^ dv. 

The kinetic energy of the regular part of the flux is therefore, for 
each unit of volume, 

T = ^'(a*+;8Hy'). (8) 

9. With respect to the kinetic energy of the irregular part of the 
flux, it is to be observed that, since f', have their average values 
zero in spaces which are very small in comparison with a wave-length, 
the integrations implied in the notations Pot Pot i]', Pot f" may be 
confined to a sphere of a radius which is small in comparison with a 
wave-length. Since within such a sphere jy', are sensibly deter¬ 
mined by the values of jy, ^ at the center of the sphere, which is the 
point for which the value of the potentials are sought, Pot Pot 
Pot must be functions—evidently linear functions—of ii, f; and 

Pot i] Pot I' Pot must be quadratic functions of the same 
quantities. But these functions will vary with the position of the 
point considered with reference to the adjacent molecules. 

Now the expression for the kinetic energy of the irregular part 
of the flux, 

Potf'dv, 
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indicates that we may regard the infinitesimal element dv as having 
the energy (due to this part of the flux) 

JSf Potf 

Let us consider the energy due to the irregular flux which will belong 
to the above defined element Dv, which is not infinitely small, but 
which has the advantage of being one of physically similar elements 
which make up the whole medium. The energy of this element is 
found by adding the energies of all the infinitesimal elements of 
which it is composed. Since these are quadratic functions of the 
quantities which are sensibly constant throughout the element 

Dv, the sum will be a quadratic function of rj, f, say 

(A'f ^ B V ^ E'i;f + F'ti-h G'^i;)Dv, 

which will therefore represent the energy of the element Dv due to 
the irregular flux. The coefiicients A', B', etc., are determined by the 
nature of the medium and the period of oscillation. They will be 
constant throughout the medium, since one element Dv does not differ 
from another. 

This expression reduces by equations (4) to 

A'a2 + + cy + E'/3y + F'ya + G'a/3) cos’* iw-.Bv. 

P 6 

The kinetic energy of the irregular flux in a unit of volume is there¬ 
fore 9 -J 2 

T' = ^ (A 4 - 4 +E'^y 4 F'ya 4 G'a/3). (9) 

10. Equating the statical and kinetic energies, we have 
^ (Aa^ 4 B/3^ 4 Cy^ 4 E/3y 4 Fya 4 Ga/3) 

= ~(.a^ + ^ + y^)+ y{A'a^++ Cy + E'/8y + Fya+G'a/3). (10) 

The velocity (V) of the corresponding system of progressive waves is 
given by the equation 

Zy 1 Aa^4Bffl-^Cy^4E^y4Fy«4Ga/3 

27r a^4/S^ + y® 



2 TT A'a* 4 -f- C'y ® 4 E'^Sy 4 F'ya 4 Q'a/3 

a*4/^-f y^ 

(11) 

If we set 

a = ~A-^A', b=^B-^B', etc., 

2ir P^ 2irp^ 

(12) 

and 

p2 = a24/S^4y2, 


the equation 

reduces to 



aa^ 4 bff' 4 cy* 4 efiy -f/y a 4 ga^ 

(13) 
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For a given medium and light of a given period, the coefficients a, h, 
etc., are constant. 

This relation between the velocity of the waves and the direction of 
oscillation is capable of a very simple geometrical expression. Let r 
be the radius vector of the ellipsoid 

ax^-{-hy'^-^cz^-\-eyz+fzx‘\-gxy— (14) 

Then — = -f 4- cz^ H - e yz +fzx -f gxy 


If this radius is drawn parallel to the electrical oscillations, we shall 


have 


T p' r p’ r p* 


and 



(15) 


That is, the wave-velocity for any particular direction of oscillation 
is represented in the ellipsoid by the reciprocal of the radius vector 
which is parallel to that direction. 

11 . This relation between the wave-length, the period, and the 
direction of vibration, must hold true not only of such vibrations as 
actually occur, but also of such as we may imagine to occur under 
the influence of constraints determining the direction of vibration in 
the wave-plane. The directions of the natural or unconstrained 
vibrations in any wave-plane may be determined by the general 
mechanical principle that if the type of a natural vibration is infini¬ 
tesimally altered by the application of a constraint, the value of the 
period will be stationary.* Hence, in a system of stationary waves 
such as we have been considering, if the direction of an unconstrained 
vibration is infinitesimally varied in its wave-plane by a constraint 
while the wave-length remains constant, the period will be stationary.' 
Therefore, if the direction of the unconstrained vibration is inflnitesi- 
mally varied by constraint, and the period remains rigorously constant, 
the wave-length will be stationary. Hence, if we make a central 
section of the above described ellipsoid parallel to any wave-plane, 
the directions of natural vibration for that wave-plane will be parallel 
to the radii vectores of stationary value ii;i that section, viz., to the 
axes of the ellipse, when the section is elliptical, or to all radii, when 
the section is circular. 

12 . For light of a single period, our hypothesis has led to a 
perfectly definite result, our equations expressing the fundamental 
laws of double refraction as enunciated by Fresnel. But if we ask 
how the velocity of light varies with the period, that is, if we seek 


See Rayleigh’s Theory of Sounds voh i, p, 84. 
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to derive from the same equations the laws of the dispersion of colors, 
we shall not be able to obtain an equally definite result, since the 
quantities A, B, etc., and A', B', etc., are unknown functions of the 
period. If, however, we make the assumption, which is hardly likely 
to be strictly accurate, but which may quite conceivably be not far 
removed from the truth, that the manner in which the general or 
average flux in any small part of the medium distributes itself among 
the molecules and intermolecular spaces is independent of the period, 
the quantities A, B, etc., and A', B', etc., will be constant, and we 
obtain a very simple relation between V and />, which appears to agree 
tolerably well with the results of experiment. 


If we set 


and 


^ ^ Aa^ -h Bff^ H- Cy2 -f E/3y -h Fyg -f Ga/3 

A'a^ 4- 4- C'y^ 4- E'/Sy -f F'ya 4“ 

M - ^2 . 


our general equation (11) becomes 


(16) 

(17) 


■”2x 


(18) 


where H and H' will be constant for any given direction of oscillation, 
when A, B, etc., and A', B', etc., are constant. If we wish to introduce 
into the equation the absolute index of refraction (n) and the wave¬ 
length in vacuo (X) in place of V and p, we may divide both sides of 
the equation by the square of the constant (k) representing the 
velocity of light in vacuo. Then, since 


y = i-, and kp = \, 
k 71 ^ 


our equation reduces to 


1- S 2xH^ 

2'7ri* X-^ 


(19) 


It is well known that the relation between n and X may be tolerably 
well but by no means perfectly represented by an equation of this 
form. 

13. If we now give up the presumably inaccurate supposition that 
A, B, etc., and A', B', etc., are constant, equation (19) will still subsist, 
but H and H' will not be constant for a given direction of oscillation, 
but will be functions of p, or, what amounts to the same, of X. 
Although we cannot therefore use the equation to derive a priori the 
relation between n and X, we may use it to derive the values of H 
and H' from the empirically determined relation between n and X. 
To do this, we must make use again of the general principle that an 
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infinitesimal variation in the type of a vibration, due to a constraint, 
will not affect the period. If we first consider a certain system of 
stationary w’aves, then a system in which the wave-length is greater 
by an infinitesimal dl (the direction of oscillation remaining the same), 
the period will be increased by an infinitesimal dp, and the manner in 
which the flux distributes itself among the molecules and intenaole- 
cular spaces will presumably be infinitesimally changed. But if we 
suppose that in the second system of waves there is applied a con¬ 
straint compelling the flux to distribute itself in the same way among 
the molecules and intermolecular spaces as in the first system (so that 
^ . V > f shall be the same functions as before of ri, —a supposition 
perfectly compatible with the fact that the values of rj, f are 
changed), this constraint, according to the principle cited, will not 
affect the period of oscillation. Our equations will apply to such a 
constrained type of oscillation, and A, B, etc., and A\ B', etc., and 
therefore H and H', will have the same values in the last described 
system of waves as in the first system, although the wave-length and 
the period have been varied. Therefore, in differentiating equation 

(18) , which is essentially an equation between I and p, or its equivalent 

(19) , we may treat H and H' as constant. This gives 

2 dn 

d\ 


We thus obtain the values of H' and H 


X® dn tt _ 2 xA:^ 2xifc®X dn 
dX dX' 


( 20 ) 


By determining the values of H and H' for different directions of 
oscillation, we may determine the values of A, B, etc., and A', B', etc. 

By means of these equations, the ratias of the statical energy (S), 
the kinetic energy due to the regular part of the flux (T), and the 
kinetic energy due to the irregular part of the flux (T'), are easily 
obtained in a form which admits of experimental determination. 
Equations (8) and (9) give 

2x^Hy 

2)^ ’ p- 

Therefore, by (20), 


T' 2xH" 2xH X dn _ d log ii 

= lX~ ^dlog'X 

S _T-f-T'_ . T'_d log X — d log n _ d lo g I 

f T~ ~ T d log X ~ ci^ log X * 


( 21 ) 

( 22 ) 


TT d log n 


(23) 



192 


DOUBLE EEFRACnON AND THE DISPEBBION OF 


Since S, T, and T' are essentially positive quantities, their ratios 
must be positive. Equation (21) therefore re(juires that the index 
of refraction shall increase as the period or wave-length in vacuo 
diminishes. Experiment has shown no exceptions to this rule, except 
such as are manifestly attributable to the absorption of light. 

14. It remains to consider the relations between the optical pro¬ 
perties of a medium and the planes or axes of symmetry which it 
may possess. If we consider the statical energy per unit of volume 
(S) and the period as constant, we may regard equation ( 2 ) as the 
equation of an ellipsoid, the radii vectores of which represent in 
direction and magnitude the amplitudes of systems of waves having 
the same statical energy. In like manner, if we consider the kinetic 
energy of the irregular part of the flux per unit of volume (T') and 
the period as constant, we may regard ecjuation (9) as the equation 
of an ellipsoid, the radii vectores of which represent in direction and 
magnitude the amplitudes of systems of waves having the same 
kinetic energy due to the irregular part of the flux. These ellipsoids, 
which we may distinguish as the ellipsoids (A, B, etc.) and (A', B', etc.), 
as well as the ellipsoid before descril>ed, which we may call the 
ellipsoid (a, b, etc.), must be independent in their form and their 
orientation of the directions of tlie axes of coordinates, being deter¬ 
mined entirely by the nature of the medium and the period of 
oscillation. They must therefore possess the same kind of symmetry 
as the internal structure of the medium. 

If the medium is symmetrical about a certain axis, each ellipsoid 
must have an axis parallel to that. If the medium is symmetrical 
with respect to a certain plane, each ellipsoid must have an axis at 
right angles to that plane. If the medium after a revolution of less 
than 180° about a certain axis is then equivalent to the medium in 
its first position, or symmetrical with it w’ith respect to a plane at 
right angles to that axis, each ellipsoid must have an axis of revolution 
parallel to that axis. These relations must be the same for light of 
all colors, and also for all temperatures of the medium. 

15. From these principk^s we may infer the optical characteristics 
of the different crystallographic .systems. 

In crystals of the isometric .system, as in amorphous bodies, the 
three ellipsoids reduce to spheres. Such media are optically isotropic 
at least so far as any properties are concerned which come within the 
scope of this paper. 

In crystals of the tetragonal or hexagonal systems, the three 
ellipsoids will have axes of rotation parallel to the principal crystal¬ 
lographic axis. Since the ellipsoid (a, 6, etc.) has but one circular 
section, there will be but one optic axis, which will have a fixed 
direction. 
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In crystals of the orthorhombic system, the three ellipsoids will 
have their axes parallel to the rectangular crystallographic axes. If 
we take these directions for the axes of coordinates, E, F, G, E', F', G', 
e, / g will vanish and equation (13) will reduce to 

p" * 

If the coordinate axes are so placed that 

a> b> c, 


the optic axes will lie in the X-Z plane, making equal angles <f> with 
the axis of Z, which may be determined by the equation 




/)2(A~B)-47r2(A'-B') 

2>(B~C)-4x2(B'-C')' 


To get a rough idea of the manner in which 0 varies with the period, 
we may regard A, B, C, A\ B', C' as constant in this equation. 

But since the lengths of the axes of the ellipsoid (a, h, etc.) vary 
with the period, it may easily happen that the order of the axes 
with respect to magnitude is not the same for all colors. In that 
case, the optic axes for certain colors will lie in one of the principal 
planes, and for other colors in another. For the color at which the 
change takes place, the two optic axes will coincide. The differential 

coefficient ^ becomes infinitely great as the optic axes approach 
coincidence. 

In crystals of the monoclinic system, each of the three ellipsoids 
will have an axis perpendicular to the plane of symmetry. We may 
choovse this direction for the axis of X. Then F, G, F', G', /, g, will 
vanish and ecjuation (13) will reduce to 

aa^ -f bj3^ 4- -h 

. 




The angle S made by one of the axes of the ellipsoid (a, 6, etc.) in 
the plane of symmetry with the axis of Y and measured toward the 
axis of Z, is determined by the equation 


tan2e- ^ 

tan .JW - ^ ^ _ B) _ 4^2(C' _ B')' 

To get a rough idea of the dispersion of the axes of the ellipsoid 
(a, b, etc.) in the plane of symmetry, we may regard B, C, E, B', C', E', 
as constant in this equation, and suppose the axis of Y so placed as 
to make E vanish. 

It is evident that in this system the plane of the optic axes will be 
fixed, or will rotate about one of the lines which bisect the angles 
made by the optic axes, according as the mean axis of the ellipsoid 
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(a, b, etc.) is perpendicular to the plane of symmetry or lies in that 
plane. In the first case the dispersion of the two optic axes will be 
unequal. The same crystal, however, with light of different colors, 
or at different temperatures, may afford an example of each case. 

In crystals of the triclinic system, since the ellipsoids (A, B, etc.) 
and (A', B', etc.) are determined by considerations of a different 
nature, and there are no relations of symmetry to cause a coincidence 
in the directions of their axes, there will not in general be any such 
coincidence. Therefore the three axes of the ellipsoid (a, h, etc.), that 
is, the two lines which bisect the angles of the optic axes and their 
common normal, will vary in position with the color of the light. 

16. It appears from this foregoing discussion that by the electro¬ 
magnetic theory of light we may not only account for the dispersion 
of colors (including the dispersion of the lines which bisect the angles 
of the optic axes in doubly refracting media), but may also obtain 
Fresnel’s laws of double refraction for every kind of homogeneous 
light without neglect of the quantities which determine the dispersion 
of colcyrs. 

But a closer approximation than that of this paper will be neces¬ 
sary to explain the phenomena of circularly polarizing media, which 
depend on very minute differences of wave-velocity, represented 
perhaps by a few units in the sixth significant figure of the index 
of refraction. That the degi'ee of approximation which will give the 
laws of circular and elliptic polarization will not add any terms to 
the equations of this paper, except such as vanish for media which 
do not exhibit this phenomenon, will be shown in another number 
of this Journal. 



XII. 


ON DOUBLE REFRACTION IN PERFECTLY TRANSPARENT 
MEDIA WHICH EXHIBIT THE PHENOMENA OF CIR- 
CULAR POLARIZATION. 


[American Journal of Science^ ser. 3, vol. xxiii, pp. 460-476, June, 1882.] 


1. In the April number of this Journal,* the velocity of propagation 
of a system of plane waves ol light, regarded as oscillating electrical 
fluxes, was discussed with such a degree of approximation as would 
account for the dispersion of colors and give Fresnel’s laws of double 
refraction. It is the object of this paper to supplement that discussion 
by carrying the approximation so much further as is necessary in 
order to embrace the phenomena of circularly polarizing media. 

2. If we imagine all the velocities in any progressive system of 
plane waves to be reversed at a given instant without affecting the 
displacements, and the system of wave-motion thus obtained to be 
superposed upon the original system, we obtain a system of stationary 
waves having the same wave-length and period of oscillation as the 
original progressive system. If we then reduce the magnitude of 
the displacements in the uniform ratio of two to one, they will be 
identical, at an instant of maximum displacement, with those of the 
original system at the same instant. 

Following the same method as in the paper cited, let us especially 
consider the system of stationary waves, and divide the whole dis¬ 
placement into the regular part, represented by J 7 , and the 
irregular part, represented by ly', f', in accordance with the 
definitions of § 2 of that paper. 

3. The regular part of the displacement is subject to the equations 
of wave-motion, which may be written (in the most general case 
of plane stationary waves) 


( u u\ t ^ 

Uj cos 2 ^ y 4'«2 2 '”' j) * 

ly = cos 2xy-f/Sgsin 27r cos 27r - , 

f 3 = cos 2x ^ -f Vf sin 2x cos 2x ~ , 


(1) 


'*^See page 182 of this volume. 
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where I denotes the wave-length, p the period of oscillation, u the 
distance of the point considered from the wave-plane passing through 
the origin, a^, the amplitudes of the displacements fj, f in 

the wave-plane passing through the origin, and a^, 72 

amplitudes in a wave-plane one-quarter of a wave-length distant 
and on the side toward which u increases. If we also write L, M, N 
for the direction-cosines of the wave-normal drawn in the direction 
in which u increases, we shall have the following necessary relations: 

L2-hM2+N2=l, (2) 

u = La;4-M2/-|-N2r, (3) 

Laj-+-M|8j-4-Nyj = 0, La2-f-M/32 + Ny2 = 0. (4) 

4. That the irregular part of the displacement (^, at any 

given point is a simple harmonic function of the time, having the 
same period and phase as the regular part of the displacement (^, 7^, f), 
may be proved by the single ))rinciple of superposition of motions, 
and is therefore to be regarded as exact in a discussion of this kind. 
But the further conclusion of the preceding paper (§ 4), “ that the 
values of r}\ at any given point in the medium are capable of 
expression as linear functions of rj, f in a manner which shall be 
independent of the time and of the orientation of the wave-planes and 
the distance of a nodal plane from the point considered, so long as the 
period of oscillation remains the same,’' is evidently only approxima¬ 
tive, although a very close approximation. A very much closer 
approximation may be obtained, if we regard rj', at any given 
point of the medium and for light of a given period, as linear functions 
of rj, f and the nine differential coefficients 

df ^ ^ 

dx’ dx’ dx’ dy' 

We shall write i], f and diff. coeff. to denote these twelve quantities. 

From this it follows immediately that with the same degree of 
approximation may be regarded, for a given point of the 

medium and light of a given period, as linear functions of ii, f and 
the differential coefficients of f, f with respect to the coordinates. 
For these twelve quantities we shall write f and diff, coeff, 

5. Let us now proceed to equate the statical energ}’' of the medium 
at an instant of no velocity with its kinetic energy at an instant of 
no displacement. It will be convenient to estimate each of these 
quantities for a unit of volume. 

6 . The statical energy of an infinitesimal element of volume may be 

represented by a dv, where «r is a quadratic function of the components 
of displacement ^ + f+T- Since for that element of volume 

ri', f' may be regarded as linear functions of jy, f and diff, coeff,. 
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we may regard or as a quadratic function of > 7 , f aTid diff. coeff., or 
as a linear function of the seventy-eight squares and products of these 
quantities. But th§ seventy-eight coefficients by which this function 
is expressed will vary with the position of the element of volume with 
respect to the surrounding molecules. 

In estimating the statical energy for any considerable space by the 

f.dv, 

it will be allowable to substitute for the seventy-eight coefficients 
contained implicitly in or their average values throughout the medium. 
That is, if we write a for a quadratic function of rj, f, and diff. coeff. 
in which the seventy-eight coefficients are the space-averages of those 
in <r, the statical energy of any considerable space may be estimated 
by the integral j. ^ 

(This will appear most distinctly if we suppose the integration to 
be first effected for a thin slice of the medium bounded by two 
wave-planes.) The seventy-eight coefficients of this function a are 
determined solely by the nature of the medium and the period of 
oscillation. 

We may divide a into three parts, of which the first (a^) contains the 
squares and products of the second {a^) contains the products of 

j/, f with the differential coefficients, and the third contains the 
squares and products of the differential coefficients. It is evident that 
the average statical energy of the whole medium per unit of volume 
is the space-average of «, and that it will consist of three parts, which 
are the space-averages of a ^, s,,, and , respectively. These parts we 
may call , and . Only the first of these was considered in 

the preceding paper. 

Now the considerations which justify us in neglecting, for an 
approximate estimate, the terms of a which contain the differential 
coefficients of ti, f with respect to the coordinates, will apply with 
especial force to the terms which contain the squares and products of 
these differential coefficients. Therefore, to carry the approximation 
one step beyond that of the preceding paper, it will only be necessary 
to take account of a^ and a^^ and of S, and 
7. We may set 

s, = 0^4- Ei/f-h (5) 

where, for a given medium and light of a given period, A, B, C, E, F, 
G are constant. 

Since the average values of 

sin® 27 r^, cos* 2 xj, sin 27r^cos 2 x j 



198 DOUBLE REFRACrriON AND CIRCULAR POLARIZATION 


are respectively and 0, and since at the time to be considered 

co8*2'7r- = 1, 

P ♦ 

it will appear from inspection of equations (1) that 

~ i 4" 4- Cyi^ 4- E/Siyi 4- Fyjai + Grui^i) 

4- i ( 4- 4“ Cyj^ 4“ E^2y2 + F y2a2 4- G (6) 

This is the first part of the statical energy of the whole medium 
per unit of volume. 

8 . The second part of the statical energy of the whole medium 
per unit of volume (S^^) is the space-average of which is a linear 
function of the twenty-seven products of f, >?, f with their differential 
coefficients with respect to the coordinates. Now since 

^dx * dx ’ ^(ia: ^ dx ' ’ 


the space-average of such products will be zero, and they will 
contribute nothing to the value of There will be nine of these 
products, in which the same component of displacement appears 
twice. The remaining eighteen products may be divided into pairs 
according to the letters which they contain, as 

fS- 

A linear function of the eighteen products may also be regarded 
as a linear function of the sums and differences of the products in 
such paira But since 

dt dt! d(nt) 

^ dx dx 

the terms of 8^^ containing such sums will contribute nothing to the 
value of We have left a linear function of the nine differences 

etc 

'‘dx ^dx’ ^dx ^dx’ ^dx ^dx’ 

(the unwritten expressions being obtained by substituting in the 
denominators dy and dz for dx), which constitutes the part of 
that we have to consider. is therefore a linear function of the 
space-averages of these nine quantities. But by (3) 


dx du ^du/’ 


and the space-average of this, at a moment of maximum displacement, 
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By such reductions it appears that is a linear function of the 
nine products of L, M, N with 

Yi°2-°iY2: “A-Aar 

Now if we set 

© = L(Ay2- yiA)+M(yia2- aiyjj)+ N(aiA-Aa2). C^) 

we have by (4) and (2) 

L0 = Ay2-yiA. Me = y,as-a,yj. N© = aiA-Aa2- («) 

Therefore is a linear function of the nine products of L, M, N 
with LG, M0, NO. That is, ZS^, is the product of 0 and a quadratic 
function of L, M and N. We may therefore write 

where ^ is a quadratic function of L, M and N, dependent, however, 
on the nature of the medium and the period of oscillation. 

9. It will be useful to consider more closely the geometrical 
significance of the quantity 0. For this purpose it will be convenient 
to have a definite understanding with respect to the relative position 
of the coordinate axes. 

We shall suppose that the axes of X, Y, and Z are related in the 
same way as lines drawm to the right, forward and upward, so that 
a rotation from X to Y appears clockwise to one looking in the 
direction of Z. 

Now if from any same point, as the origin of coordinates, we lay 
off lines representing in direction and magnitude the displacements 
in all the different wave-planes, we obtain an ellipse, which we may 
call the displacement-ellipse* Of this, one radius vector (p^) will 
have the components a^, jSj, y^, and another (pg) the components 
Qg, /Sg, 72 * These will belong to conjugate diameters, each l)eing 
parallel to the tangent at the extremity of the other. The area of 
the ellipse will therefore be equal to the parallelogram of which 
Pj and p2 are two sides, multiplied by tt. Now it is evident that 
Ay 2 ”~yA» yi« 2 ~<^iy 2 > numerically equal to the pro¬ 

jections of this parallelogram on the planes of the coordinate axes, 
and are each positive or negative according as a revolution from 
Pi to p 2 appears clockwise or counter-clockwise to one looking in 
the direction of the proper coordinate axis. Hence, 0 will be 
numerically equal to the parallelogram, that is, to the area of the 
displacement-ellipse divided by tt, and will be positive or negative 

* This ellipse, which represents the simultaneous displacements in different parts of 
the field, will also represent the successive displacements at any same point in the 
corresponding system of progressive waves. 
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according as a revolution from to p 2 appears clockwise or counter¬ 
clockwise to one looking in the direction of the wave-normal. Since 
Pj and p 2 are determined by displacements in planes one-quarter of 
a wave-length distant from each other, and the plane to which the 
latter relates lies on the side toward which the wave-normal is drawn, 
it follows that 0 is positive or negative according as the combination 
of displacements has the character of a right-handed or a left-handed 
screw. 

10 . The kinetic energy of the medium, which is to be estimated 

for an instant of no displacement, may be shown as in § 7 of the 
former paper (page 185 of this volume) to consist of two parts, of 
which one relates to the regular flux (^, fj, f), and the other to the 
in*egular flux f'). The first, in the notation of that paper, is 

represented by 

hfii ^ ^ Pot ^ -h f Pot ^)dv, 

which reduces to 

~Ae+^^+hdv. 

By substitution of the values given by equations (1), we obt-ain for 
the kinetic energy due to the regular flux in a unit of volume 

T = ^'(ai*+AHy.*+a,»+/3./+y,^). (10) 

11 . The kinetic energy of the irregular part of the flux is repre¬ 
sented by the volume-integral 

fid' Pot i' + n' Pot n'+t Pot ^') dv. 

Now, since ^ are everywhere linear functions of fj, f and 

diff. coeff, (see § 4), and since the integrations implied in the notation 
Pot may be confined to a sphere of which the radius is small in 
comparison with a wave length,* and since within such a sphere jy, f 
and diff. coeff. are sufficiently determined (in a linear form), by the 
values of the same twelve quantities at the center of the sphere, it 
follows that Pot Pot jy', Pot ^ must be linear functions of the values 
of Tiy f and diff. coeff. at the point for which the potential is sought. 
Hence, 

h d' i' + t}' -f f' Pot 

will be a quadratic function of ij, f and diff. coeff. But the 
seventy-eight coefficients by which this function is expressed will 
vary with the position of the point considered with respect to the 
surrounding molecules. 

* See § 9 of the former paper, on page 187 of this volume. 
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Yet, as in the case of the statical energy, we may substitute the 
average values of these coefficients for the coefficients themselves in 
the integral by which we obtain the energy of any considerable space. 
The kinetic energy due to the irregular part of the flux is thus 
reduced to a quadratic function of f amd diff. coeff. which has 
constant coefficients for a given medium and light of a given period. 

The function may be divided into three parts, of which the first 
contains the squares and products of rj, the second the products 
of Tj, f with their differential coefficients, and tlie third, which may 
be neglected, the squares and products of the differential coefficients. 

We may proceed with the reduction precisely as in the case of the 
statical energy, except that the differentiations with respect to the 

4>7r^ 

time will intioduce the constant factor This will give for 

the first part of the kinetic energy of the iiTegular flux per unit 
of volume 

T', = +CV.* + + F'y,a, + G'« A) 

^ + E'/3jy, + F'yjo^+G'oj/?.), 

and for the second part of the same 


( 11 ) 


rjy 


pH 

47r*^' 

pH 


[E(^iy2 - yi A) + M (y,a2 - Oiy^) + N (a,^2 - /SiOj,)], (12) 


where A', B', C', E', F', G' are constant, and a quadratic function of 
L, M, and N, for a given medium and light of a given period. 

12 . Equating the statical and kinetic energies, we have 
S^-hS,^ = T-hT;-fT;^, 

that is, by equations (6), (9), (10), (11), and (12), 


J (Aa/ + B/ 3 i® + Cy,* + E/ 3 iy, + Fyia, + Oa^^) 

+ i + Cyj*+E^ jVs+F y2a2+Goj/S,) 

+ J [E(Ay2 - yA) + M(y,a2-aiy2)+N(ai^2 “ A“!)] 


= ^(a,^+A* + y,* + a2*+/32*+y2*) 

+(A'oj* + + C'y,* + E'^,y, + F'yiaj + Gui^,) 

+ ^ ( A'Oj® + B'/Sj® + C'y,* + E'^2y2 + E'y2“2 + GJ'a2^2) 

JT 

+ ^ [L(Ay2-yA)+M(y,a2-a,y2) + N(aA-A«2)]- ^3) 



202 DOUBLE REFRACTION AND CIRCULAR POLARIZATION 


If we set 


and 


27r 


27rA^ 


6 = 




B 27rB^ 

2x ' 

2x^; 


etc., 


the equation reduces to 

aa,=+ + c-y f 4- f/S,-/, +/y,a, + 

4" 4" ^^2 4" ^y2^ 4“ ^^2V2 4“^V2®2 4"5/<22^2 

4- [L(Ay2 - yiA)4- M (y,a2 - ajy,) 4- N (oj^j - ^jo,)] 


(14) 

(15) 


=|2(V+A^+ri^+«2H^2^+y2'0. (16) 

where ft, c, r;, /, ^ are constant, and 0 a quadratic function of 
L, M, N, for a given medium and light of a given period. 

13. Now this equation, which expresses a relation between the 
constants of the equations of wave-motion (1), will apply, with those 
ecjuations, not only to such vibrations as actually take place, but also 
to such as we may imagine to take place under the influence of 
constraints determining the type of vibration. I^he free or uncon¬ 
strained vibrations, with which alone we are concerned, are charac¬ 
terized by this, that infinitesimal variations (by constraint) of the 
type of vibration, that is, of the ratios of the quantities Uj, yj, 
02 , /32, 72 , will not affect the period by any quantity of the same 
order of magnitude.* These variations must however be consistent 
with equations (4), which require that 

L da, + M d/3, + N dy, - 0, L da^ + M dft -b N dyg = 0. (17) 
Hence, to obtain the conditions which characterize free vibration, 
we may differentiate equation (16) with respect to a,, ft, Vi, ft’ 72 ’ 
regarding all other letters as constant, and give to da,, d^,, dy,, 
dug, dft, dy 2 , such values as are consistent with equations (17). 
Now da,, dft, dyj, are independent of dug, dft, dyj, and for either 
three variations, values proportional either to Oj, ft, y^, or to Og, ft, y^, 
are possible. If, then, we differentiate equation (16) with respect to 
Oj, ft, y^, and substitute first a^. ft, y^, and then ft, yg, for 
da^, dft, dy^, and also differentiate with respect to Og, ft, with 
similar substitutions, we shall obtain all the independent equations 
which this principle will yield. 

If we differentiate with respect to a^, ft, y^, and write a^, ft, y^ 
for duj, dft, dyi, we obtain 

4- fc/3/4-cy/4- e/3jy, 4-/y,a, 4-fl'ai^, 

4-^ [L(/3jy2 —y,^j)+ M(y,a2 —aiy2)4-N(aj^2 —^,a,)] 


-I 

P* 


(a,^+A^+ri^). 


(18) 


Compare § 11 of the former paper, page 189 of this volume. 
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If we differentiate with respect to aj, and write aj, for 

da^, djSi, dy^, we obtain 

-f + 2cyiy2 + ^(^ 172 +Vi A) “b/(yia2 + aiy2 ^ 

2 Z 2 

+^7(ai/32 + /^ia2) = ^ («i«2 + ft^2 + yiy2)* (1^) 

If we differentiate with respect to og, ^ 2 > y 2 > write ag, / 32 , y^ i"or 
cZa^, (Z^ 2 > ^y 2 > obtain 

+ 6 ^ 2 ^ + ("72*^ -h 6^8272 +/y 2 a 2 + 

-f [L(/ 3 iy 2 - yi/^a) + (yi"2 «iy2) + N(01^82 ~ ^02)] 

( 20 ) 

The equation derived by differentiating with respect to a^, ^ 2 > y 2 » 
writing a^, y^ for da^, d/32, identical with (19). We should 

also observe that equations (18) and ( 20 ) by addition give equation 
(16), which therefore will not need to be considered in addition to 
the last three equations. 

14. The geometrical signification of our equations may now be 
simplified by a suitable choice of the position of the origin of 
coordinates, which is as yet wholly arbitrary. 

We shall hereafter suppose that the origin is placed in a plane of 
maximum or minimum displacement,* if such there are. In the case 
of circular polarization, in which the displacements are everywhere 
equal, its position is immaterial. The lines and pg, of which a^, / 81 , 
yj and Uj, /Sg, y* are respectively the components, will now be the 
semi-axes of the displacement-ellipse, and therefore at right angles. 
(See § 9.) The C 6 ise of circular polarization will not constitute any 
exception. Hence, 

aia2 + ^i;8s+7iyj = 0, (21) 

and by § 9, 

© = L(/3,y5-yi^,) + M(yiaj-aiyj)+N(ai^2-/3iaj)= zzpjp^’ (22) 

where we are to read -f or — in the last member according as the 
system of displacements has the character of a right-handed or a 
left-handed screw. 

15, Equation (19) is now reduced to the form 

2aaia j + 26^,+ 2cy,yj, + e(/8,yj + yj/Sj) 

+/(Vi«! + “iVa) +i/(“i /®2 + * = 0- (23) 

*The reader will perceive that an earlier limitation of the position of the origin by a 
supposition of this nature, involving a limitation of the values of Oj, /Sj, ajj, ^ 3 , y.^, 
would have been embarrassing in the operations of the last paragraph. 
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which has a very simple geometrical signification. If we consider 
the ellipsoid ax^+hy^+cz'^Jreyz+fzx+gxy. (24) 

and especially its central section by a plane parallel to the planes of 
the wave-system which we are considering, it will easily appear that 
the equation 

2(ix^x^ 4 - 2hy^y^ + + ^iVi) 

+ 9(^iy2 + Vl^i) = ^ 

will hold of any two points ajj, yj, and x^, y^, which belong to 
conjugate diameters of this central section. Therefore equation (23) 
expresses that the displacements Uj, and Ug, ^ 2 , yg are parallel 

to conjugate diameters of the central section of the ellipsoid (24) by a 
wav(3-plane. But since the displacements fii, y^ and /Sg, yg are 
also at right angles to each other, it follows that they are parallel 
to the axes of the central section of the ellipsoid (24) by a wave-plane. 
That is:—The axes of the displacement-ellipse coincide in direction 
with those of a central section of the ellipsoid (24) by a wave-plane. 

16 If we write Uj, Ug for the reciprocals of the semi-axes of the 
central section of the ellipsoid (24) by a wave-plane, Uj being the 
reciprocal of the one to which the displacement a^, y^ is parallel, 
we have 

aai*+W+‘’yi’+«Ayi+/yi«i+s'“iA=(25) 
ac is at once evident if we substitute the coordinates of an extremity 
of the axis for the proportional quantities a^, y^ So also 

aaj“ + + cy,* + «^,y, y/). (26) 

If we write V for the velocity of propagation of the system of 
progressive waves corresponding to the system of stationary waves 
which we have been considering, we shall have 

V=-. (27) 

T 

By equations (22), (26), and (26), equations (18) and (20) are 
reduced to the form 

= />!/>*=VW. (28) 

where we are to read 4- or — according as the disturbance has the 
character of a right-handed or a left-handed screw. In a progressive 
system of waves, when the combination of displacements has the 
character of a right-handed screw, the rotations will be such as appear 
clockwise to the observer, who looks in the direction opposite to that 
of the propagation of light. We shall call such a ray righUkanded. 

We may here observe that in case ^ = 0 the solution of these 
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equations is very simple. We have necessarily either — O and 
V^ = Ui^, or p^ = 0 and V2 = U 2 ^. In this case, the light is linearly 
polarized, and the directions of oscillation and the velocities of 
propagation are given by Fresners law. Experiment has shown that 
this is the usual case. We wish, however, to investigate the case in 
which (p does not vanish. Since the term containing 0 arises from 
the consideration of those quantities which it was allowable to neglect 
in the first approximation, we may assume that (p is always very 
small in comparison with V^, U/, or 
17. Equations (28) may be written 


V2_TT2 _i_ ^ P2 V2 —TT2_^^^ 


(29) 


By multiplication we obtain 

V2(V2 ~ V2 - Ug^) = 02 (30) 

Since 0 is a very small quantity, it is evident from inspection of this 
equation that it will admit three values of V‘^, of which one will be a 
very little greater than the greater of the two quantities and 
another will be a very little less than the less of the same two quan¬ 
tities, and the third will be a very small quantity. It is evident that 
the values of with which we have to do are those which differ but 
little from and 

For the numerical computation of V, when Uj, Ug, and 0 are 
known numerically, we may divide the equation by V^, and then 
solve it as if the second member were known. This will give 



By substituting for V- in the second member, we may obtain 

a close approximation to the two values of Each of the values 
obtained may be improved by substitution of that value for in the 
second member of the equation. 

For either value of V^, we may easily find the ratio of p^ to p^, 
that is, the ratio of the axes of the displacement-ellipse, from one of 
equations (29), or from the equation 


P2“-_ 


obtained by combining the two. 


V2-.U22 


(32) 


* We should not attribute any physical significance to the third value of V*. For 
this value would imply a wave-length very small in comparison with the length of 
ordinary waves of light, and with respect to which our fundamental assumption that 
the wave-length is very great in comparison with the distances of contiguous molecules 
would be entirely false. Our analysis, therefore, furnishes no reason for supposing that 
any such velocities are possible for the propagation of electrical disturbances. 



206 DOUBLE REFEACTION AND CIRCULAR POLARIZATION 


In equations (29), we are to read -f- or — in the second members, 
according as the ray is right-handed or left-handed. (See § 16.) It 
follows that if the value of <(> is positive, the greater velocity will 
belong a right-handed ray, and the smaller to a left-handed, but if 
the value of 0 is negative, the opposite is the case. Except when 
^ = 0, and the polarization is linear, there will be one right-handed 
and one left-handed ray for any given wave-normal and period. 

18. When Ui = U 2 , equations (29) give 

V^=fP±|, 

where U represents the common value of Uj and Ug. The polariza¬ 
tion is therefore circular. The converse is also evident from equations 
(29), viz., that a ray can be circularly polarized only when the 
direction of its wave-normal is such that Ug. Such a direction, 
which is determined by a circular section of the ellipsoid (24) precisely 
as an optic axis of a crystal which conforms U) Fresnel’s law of double 
refraction, may be called an optic axis, although its physical pro¬ 
perties are not the same as in the more ordinary case.* If we write 
Vjt and Vl, respectively, for the wave-velocities of the right-handed 
and left-handed rays, we have 



VR^ = U“+.f-, = 

' R ^ 1 . 

(33) 

whence 



and 

V —V — ^ 

VR— Vl— y y-- 

(34) 


The phenomenon best observed with respect to an optic axis is the 
rotation of the plane of linearly; polarized light. If we denote by 0 
the amount of this rotation per unit of the distance traversed by the 
wave-plane, regarding it as positive when it appears clockwise to the 

* Our experimental knowledge of circularly or elliptioally polarizing media ia confined 
to such as are optically either isotropic or uniaxial. The general theory of such media, 
embracing the case of two optic axes, has however been discussed by Professor von Lang 
(“Theorie der Circularpolarization,'’/Si^z.-Rer. Wiener Akad.^ vol. Ixxv, p. 719). The 
general results of the present paper, although derived from physical hypotheses of an 
fsntirely different nature, are quite similar to those of the memoir cited. They would 

become identical, the writer believes, by the substitution of a constant for ^ or ^ 
in the equations of this paper. (See especially equations (18), (20), (28),) 

That a complete discussion of the subject on any theory must include the case of 
biaxial media having the property of circular or elliptical polarization, is evident from 
the consideration that it must at least be possible to produce examples of suoh media 
artificially. An isotropic or uniaxial crystal may be made biaxial by pressure. If it 
has the property of circular and elliptic polarization, that property cannot be wholly 
destroyed by the application of small pressures. 
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observer, who looks in the direction oppixsite to that of the propagation 
of the light,* we have 



1 

V 


L 



(35) 


By the preceding equation, this reduces to 


0 = 


Tr<f> 


(36) 


Without any appreciable error, we may sui>.stitute for 
which will givet 



Vu^V, 


(37) 


19. Since these equations involve unknown functions of the period 
they will not .serve for an exact determination of the relation between 
0 and the period. For a rough approximation, however, we may 
assume that the manner in which the geiu^ral displacement in any 
small part of the medium distributes itself among the molecules and 
intermolecular spaces is independent of the period, being determined 
entirely by the values of rj, and their diti'erential coefficients with 
respect to the coordinates, t For a fixed direction of the wave-normal, 
^ and will then be constant. Now equations (15) and (36) give 




<P 2x“<F' 


(38) 


To express this result in terms of the quantities directly ol^served, we 
may use the equations 




X 





where k denotes the velocity of light hi vacuo, X the wave-length 
in vacuo of the light employed, r^H, 71^ the absolute indices of refrac¬ 
tion of the two rays, and the index for the optic axis as derived 
from the ellipsoid (24) by Fresnel's law. We thus obtain 

_ - 

* When the rotation of the plane of polarization appears clockwise to the observer, 
it has the charaoter of a left-handed screw. But the circularly polarized ray to which 
Va relates, the rotation of which also appears clockwise to the observer, has the 
charaoter of a right-handed screw, 

t The degree of accuracy of this substitution may be shown aa follows. By (33) 

Va(V,a-U«) = V,(U»-V.*). 
whence V»8 -f Vx.*=(V, + VJ IP, 

v.^-v,v,+v*.«=u*. 

VaV,=U*-(V,-VU*. 

X Compare | 12 of the former paper, on page 189 of this volume. 
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In the case of uniaxial crystals, the direction of the optic axis is 
fixed. We may therefore write 




(40) 


regarding K and K' as constants. If we had used equation (37), we 
should have had the factor n* instead of Since this factor 

varies but slowly with X, it may be neglected, if its omission is 
compensated in the values of K and K'. The formula l)eing only 
approximative, such a simplification will not necessarily render it 
less accurate. 

20. But without any such assumption as that contained in tlie 
last paragraph, we may easily obtain formulae for the experimental 
determination of ^ and for the optic axis of a uniaxial crystal. 
Considerations analogous to those of § 13 of the former paper 
(page 190 of this volume), show that in differentiating e(]uation (39) 
we may regard ^ and <!>' as constant, although they may actually 
vary with X. This equation may be written 


Therefore, 


X" ‘ 



(41) 


(42) 


When has been determined by this equation, ‘4* may be found 
from the preceding. 

21. If we wish to represent ^ geometrically, like and U^, we 
may construct the surfaces 

Ax^ -i- + cz^ 4- Eyz + Fzx + Qxy = ±: 1, (43) 

the coefficients A, B, etc., being the same by which is expressed in 
terms of L^, etc. The numerical value of for any direction 
of the wave-normal, will thus be represented by the square of the 
reciprocal of the radius vector of the surface drawn in the same 
direction. The positive or negative character of ^ must be separately 
indicated. There are here two cases to be distinguished. If the 
sign of 0 is the same in all directions, the surface will be an 
ellipsoid, and we have only to know whether all the values of 0 are 
to be taken positively or all negatively. But if 0 is positive for some 
directions and negative for others, the surface will consist of two 
conjugate hyperboloids, to one of which the positive, and to the other 
the negative values belong. 
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22. The manner in which the ellipsoid (24) may be partially 
determined by the relations of symmetry which the medium may 
possess,, has been sufficiently discussed in the former paper. 

With respect to the quantity <j>, and the surfaces which determine 
it, the following principle is of fundamental importance. If one body 
is identical in its internal structure with the image by reflection of 
another, the values of 0 in corresponding lines in the two bodies will 
be numerically equal but have opposite signs.* 

It follows that if a body is identical in internal structure with its 
own image by reflection, the value of ^ (if not zero for all directions) 
must be positive for some directions and negative for others. More¬ 
over, the above described surface by which <p is represented must 
consist of two conjugate hyperboloids, of which one is identical in 
form with the image by reflection of the other. This requires that 
the hyperboloids shall be right cylinders with conjugate rectangular 
hyperbolas for bases. A crystal characterized by such properties will 
belong to the tetragonal system. Since 0 = 0 for the optic axis, it 
would be difficult to distinguish a case of this kind from an ordinary 
uniaxial crystal, unless the ellipsoid (24) should approach very closely 
to a sphere, t 

It is only in the very limited case described in the last paragraph 
that a medium which is identical in its internal structure with its 
image by reflection can have the property of circular or elliptic 
polarization. To media which are unlike their images by reflection, 
and have the property of circular polarization, we may apply the 
following general principles. 

If the medium has any axis of symmetry, the ellipsoid or hyper¬ 
boloids which represent the values of 0 will have an axis in the same 
direction. If the medium after a revolution of less than 180° about 
any axis is equivalent to the medium in its first position, the ellipsoid 
or hyperboloids will have an axis of revolution in that direction. 

28, The laws of the propagation of light in plane waves, which 


* The necessity of the opposite signs will perhaps appear most readily from the 
consideration that the direction of rotation of the plane of polarization must be opposite 
in the two bodies. 

t There is no difficulty in conceiving of the constitution of a body which would have 
the properties described above. Thus^ we may imagine a body with molecules of a 
spiral form, of which one-half are right-handed and one-half left-handed, and we may 
suppose that the motion of electricity is opposed by a less resistance within them than 
without. If the axes of the right-handed molecules are parallel to the axis of X, and 
those of the left-handed molecules to the axis of Y, their effects would counterbalance 
one another when the wave-normal is parallel to the axis of Z. But when the wave- 
normal (of a beam of linearly polarized light) is parallel to the axis of X, the left-handed 
molecules would produce a left-handed (negative) rotation of the plane of polarization, 
the right-handed molecules having no effect; and when the wave-normal is parallel to 
the axis of Y, the reverse would be the case. 
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have thus been derived from the single hypothesis that the disturb¬ 
ance by which light is transmitted consists of solenoidal electrical 
fluxes, and which apply to light of different colors and to the most 
general case of perfectly transparent and sensibly homogeneous media 
not subject to magnetic action,* are essentially those which are 
generally received as embodying the results of experiment. In no 
particular, so far as the writer is aware, do they conflict with the 
results of experiment, or require the aid of auxiliary and forced 
hypotheses to bring them into harmony therewith. 

In this respect, the electromagnetic theory of light stands in marked 
contrast with that theory in which the properties of an elastic solid 
are attributed to the ether,—a contrast which was very distinct in 
Maxwell’s derivation of Fresnel’s laws from electrical principles, but 
becomes more striking as we follow the subject farther into its details, 
and take account of the want of absolute liomogeneity in the medium, 
so as to embrace the phenomena of the dispersion of colors and circular 
and elliptical polarization. 


* The rotation of the plane of polarization which is produced by magnetic action has 
been discussed by Maxwell {Treatint on Electricity and Mayneiism, vol, ii, chap, xxi), 
and by Rowland {Amer. Joum. Matk.^ vol. iii, p. 107). 
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ON THE GENERAL EQUATIONS OF MONOCHROMATIC LIGHT 
IN MEDIA OF EVERY DEGREE OF TRANSPARENCY. 

[American Journal of Science, ser .3, vol. xxv, pp. 107-118, February, 1883.] 

1. The last April and June numbers of this Journal* contain an 
investigation of the velocity of plane waves of light, in which they 
are regarded as consisting of solenoidal electrical fluxes in an indefi¬ 
nitely extended medium of uniform and very fine-grained structure. 
It was also supposed that the medium was perfectly transparent, 
although without discussion of the physical properties on which 
transparency depends, and that the electrical motions were not 
complicated by any distinctively magnetic phenomena. 

In the present paper t the subject will be treated with more 
generality, so as to obtain the general equations of monochromatic 
light for media of every degree of transparency, whether sensibly 
homogeneous or otherwise, which have a very fine-grained molecular 
structure as measured by a wave-length of light. There will be no 
restriction with respect to magnetic influence, except that an oscil¬ 
lating magnetization of the medium will be excluded. I 

In order to conform as much as possible to the ordinary view of 

*See page.s 182-194 and 195-210 of this volume. 

t This paper contains, with some additional developments, the substance of a com¬ 
munication to the National Academy of Sciences in November, 188*2. 

X Where a body capable of magneti7.ation is subjected to the influence of light (as 
when light is reflected from the surface of iron), there are two simple hj'pothescs which 
present themselves with respect to the magnetic state of the body. One is that the 
magnetic forces due to the light are not of sufficient duration to allow the molecular 
changes which constitute magneti 2 mtion to take place to any sensible extent. The other 
is that the magnetization has a constant ratio to the magnetic force without regard to 
its duration. We might easily make a more general hypothesis which would embrace 
both of those mentioned as extreme cases, and which would be irreproachable from a 
theoretical stand-point; but it would complicate our equations to a degree which would 
not bo compensated by their greater generality, since no phenomena depending on such 
magnetization have been observed, so far as the writer is aware, or are likely to be, 
except in a very limited class of cases. 

For the purp<jse8 of this paper, therefore, it has seemed better to exclude media 
capable of magnetization, except so far the first mentioned hypothesis may be appli¬ 
cable. But it does not appear that this requires us to exclude cases in which the medium 
is subject to the influence of a permanent magnetic force, such as produces the 
phenomenon of the magnetic rotation of the plane of polarization. 
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electrical phenomena,* we shall not introduce at first the hypothesis 
of Maxwell that electrical fluxes are solenoidal.t Our results, however, 
will be such as to require us to admit the substantial truth of this 
hypothesis, if we regard the processes involved in the transmission of 
light as electrical. 

With regard to the undetermined questions of electrodynamic in¬ 
duction, we shall adopt provisionally that hypothesis which appears the 
most simple, yet proceed in such a manner that it will be evident exactly 
how our results must be altered, if we prefer any other hypothesis. 

Electrical quantities will be treated as measured in electromagnetic 
units. 

2. We must distinguish, as before, between the actual electrical 
displacements, which are too complicated to follow in detail with 
analysis, and which in their minutiae elude experimental demonstration, 
and the displacements as averaged for spaces which are large enough 
to smooth out their minor irregularities, but not so large as to oblite¬ 
rate to any sensible extent those more regular features of the electrical 
motion, which form the subject of optical experiment. These spaces 
must therefore be large as measured by the least distances between 
molecules, but small as measured by a wave-length of light. We 
shall also have occasion to con.sider similar averages for other quan¬ 
tities, as electromotive force, the electrostatic potential, etc. It will 
be convenient to suppose that the space for which the average is 
taken is the same in all parts of the field,| say a sphere of uniform 
radius having its center at the point considered. 

Whatever may be the quantities considered, such averages will be 
represented by the notation 

[ Javc* 

* It has, perhaps, retarded the acceptance of the electromagnetic theory of light that 
it was presented in connection with a theory of electrical action, which is probably more 
difficult to prove or disprove, and certainly present's more difficulties of comprehension, 
than the connection of optical and electrical phenomena, and which, as resting largely 
on a priori considerations, must naturally appear very differently to different minds. 
Moreover, the mathematical method by which the subject was treated, while it will 
remain a striking monument of its author’s originality of thought, and profoundly 
modify the development of mathematical physics, must nevertheless, by its wide depar¬ 
ture from ordinary methods, have tended to repel such as might not make it a matter 
of serious study. 

fA flux is said to be solenoidal when it satisfies the conditions which characterize 
the motion of an incompressible fluid,—in other words, if u, t), w are the rectangular 
components of the flux, when 

dw ^ dw 
dx ^ dy ^ dz ’ 

and the normal component of the flux is the same on both sides of any surfaces of 
discontinuity which may exist. 

^ This is rather to fix our ideas, than on account of any mathematical necessity. For 
the space for which the average is taken may in general be considerably varied without 
sensibly affecting the vadue of the average. 
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If, then, fi, f denote the components of the actual displacement at 
the point considered, 

[«?] JLrti 

will represent the average values of these components in the small 
sphere about that point. These average values we shall treat as 
functions of the coordinates of the center of the sphere and of the 
time, and may call them, for brevity, the average values of 17 , 
But however they may be designated, it is essential to remember that 
it is a space-average for a certain very small space, and never a time- 
average, that is intended. 

The object of this paper will be accomplished when we have 
expressed (explicitly or implicitly) the relations which subsist 
between the values of [f]Ave. [^]Ave> [flAte* different times and in 
different parts of the field,—in other words, when we have found 
the conditions which these quantities must satisfy as functions of 
the time and the coordinates. 

3. Let us suppose that luminous vibrations of any one period * are 
somewhere excited, and that the disturbance is propagated through 
the medium. The motions which are excited in any part of the 
medium, and the forces by which they are kept up, will be expressed 
by harmonic functions of the time, having the same period,t as may 
be proved by the single principle of the superposition of motions 
quite independently of any theory of the constitution of the medium, 
or of the nature of the motions, as electrical or otherwise. This is 
equally true of the actual motions, and of the averages which we are 
to consider. We may therefore set 

^TT. . . 27r, 

[^]a.. = *1 cos yt+a .2 sm yt, 
etc., 

* There is no real loss of generality in making the light monochromatic, since in every 
case it may be divided into parts, which are separately propagated, and each of M’hich 
is monochromatic to any required degree of approximation. 

t It is of course possible that the expressions for the forces and displacements should 
have constant terms. But these vdll disappear, if the displacements are measured from 
the state’ of equilibrium about which the system vibrates, and we leave out of account 
in measuring the forces (and the electrostatic potential) that which would belong to the 
system in the state of equilibrium. To prevent misapprehension, it should be added 
that the term electrical displacement is not used in the restricted sense of dielectric 
displacement or polarization. The variation of the electrical displacement, as the term 
is used in this paper, constitutes what Maxwell calls the total motion of electricity or 
true current, and what he divides into two parts, which he distinguishes as the current 
of conduction and the variation of the electrical displacement. Such a division of 
the total motion of electricity is not necessary for the purposes of this paper, and 
the term displacement is used with reference to the total motion of electricity in a 
manner entirely analogous to that in which the term is ordinarily used in the theory of 
wave-motion. 
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where t denotes the time, p the period, and aj, ttj, functions of the 
coordinates. It follows that 

( 2 ) 

etc. j 

4 . Now, on the electrical theory, these motions are excited by 
electrical forces, which are of two kinds, distinguished as electro¬ 
static and electrodynamic. The electrostatic force is determined by 
the electrostatic potential. If we write q for the actual value of the 
potential, and [g]Ave lor its value as averaged in the manner specified 
above, the components of the actual electrostatic force will be 

dq dq dq 

dx^ dy* dz' 

and for the average values of these components in the small spaces 
described above we may write 

A VC <^[^]Ave 

dx ’ dy ' dz ' 

for it will make no difference whether we take the average before 
or after differentiation. 

6 . Tlie electrodynamic force is determined by the acceleration of 
electrical flux in all parts of the field, but physicists are not entirely 
agreed in regard to the laws by which it is determined. This 
difference of opinion is however of less importance, since it will not 
affect the result if electrical fluxes are always solenoidal. According 
to tlie most simple law, the components of the force are given by 
the volume-integrals 

where represents an element of volume, and r the distance of this 
element from the point for which the value of the electromotive 
force is to be determined. In otJier words, the components of the 
force at any point are determined from the components of acceleration 
in all parts of the field by the same process by which (in the theories 
of gravitation, etc.) the value of the potential at any point is deter¬ 
mined from the density of matter in all parts of space, except that 
the sign is to be reversed. Adopting this law, provisionally at least, 
we may express it by saying that the components of electrodynamic 
force are equal to the potentials taken negatively of the components 
of acceleration of electrical flux. And we may write, for brevity, 

— Pot — Pot ^, — Pot 

for the components of force, asing the symbol Pot to denote the 
operation by which the potential of a mass is derived from its 
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density. For the average values of these components in the small 
spaces defined above, we may write 

~ Pot - Pot - Pot [ 


since it will make no difference whether we take the average before 
or after the operation of taking the potential. 

6 . If we write X, Y, Z for the components of the total electromotive 
force (electrostatic and electrodynamic), we have 


or by (2) 




etc., 


etc. 


<ir'/]Ave 
dx ’ 


(3) 


(4) 


It will be convenient to represent these relations by a vector 
notation. If we represent the displacement by U, and the electro¬ 
motive force by E, the three equations of (3) will be represented by 
the single vector equation 

[ELve= -Pot[U]Ave~V[^U, (5) 


and the three equations of (4) by the single vector equation 


[E]av. = Pot - V (6) 

where, in accordance with quaternionic usage, represents the 

vector which has for components the derivatives of [qjAve 'vith respect 
to rectangular coordinates. The symbol Pot in such a vector equation 
signifies that the operation which is denoted by this symbol in a 
scalar equation is to be performed upon each of the components of 
the vector. 

7. We may here observe that if we are not satisfied with the law 
adopted for the determination of electrodynamic force we have only 
to sul)stitute for ~Po/ in these vector equations, and in those which 
follow, the symbol for the operation, whatever it may be, by which 
we calculate the electrodynamic force from the acceleration * For 
the operation nmst be of such a character that if the acceleration 
consist of any number of parts, the force due to the whole acceleration 
will be the resultant of the forces due to the separate parts. It will 
evident!}^ make no difference whether we take an average before or 
after such an operation. 


*The same would not be true of the corresponding scalar equations, (3) and (4). For 
one component of the force might depend upon all the components of acceleration. 
Such is in fact the case with the law of electromotive force proposed by Weber. 
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8 . Let us now examine the relation which subsists between the 
values of [EJavc and [Uj^ve ^or the same point, that is, between the 
average electromotive force and the average displacement in a small 
sphere with its center at the point considered. We have already seen 
that the forces and the displacements are harmonic functions of the 
time having a common period. 

A little consideration will show that if the average electromotive 
force in the sphere is given as a function of the time, the diisplace- 
menta in the sphere, both average and actual, must be entirely 
determined. Especially will this be evident, if we consider that since 
we have made the radius of the sphere very small in comparison with 
a wave-length, the average force must have sensibly the same value 
throughout the sphere (that is, if we vary the position of the center 
of the sphere for which the average is taken by a distance not greater 
than the radius, the value of the average will not be sensibly affected), 
and that the diflerenct; of the actual and average force at any point 
is entirely determined by the motions in the immediate viciulty of 
that point. If, then, certain oscillatory motions may be kept up in 
the sphere under the influence of electrostatic and electrodynamic 
forces due to the motion in the whole field, and if we suppose the 
motions in and very near that sphere to be unchanged, but the motions 
in the remoter parts of the field to be altered, only not so as to affect 
the average resultant of electromotive force in the sphere, the actual 
resultant of electromotive force will also be unchanged throughout 
the sphere, and therefore the motions in the sphere will still be such 
as correspond to the forces. 

Now the average displacement is a harmonic function of the time 
having a period which we suppose given. It is therefore entirely 
determined for the whole time the vibrations continue by the values 
of the six quantities 

[^]ATe> [^Javcj [^Javc ’ [^Javb* [^Javc 

at any one instant, ror the same reason ttie average electromotive 
force is entirely determined for the whole time by the values of the 
six quantities 

[Y]^„, [Z]^„ 

for the same instant. The first six quantities will therefore be 
functions of the second, and the principle of the superposition of 
motions requires that they shall be homogeneous functions of the 
first degree. And the second six quantities will be homogeneous 
functions of the first degree of the first six. The coeflScients by which 
these functions are expressed will depend upon the nature of the 
medium in the vicinity of the point considered. They will also 
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depend upon the period of vibration, that is, upon i;ne color of the 
light.* 

We may therefore write in vector notation 

[EU = #[U]i„+’i'[tJU (7) 

where ^ and ^ denote linear functions.t 

The optical properties of media are determined by the form of 
these functions. But all forms of linear functions would not be 
consistent with the principle of the conservation of energy. 

In media which are more or less opaque, and which therefore 
absorb energy, ^ must be of such a form that the function always 
makes an acute angle (or none) with the independent variable. In 
perfectly transparent media, must vanish, unless the function is 
at right angles to the independent variable. So far as is known, the 
last occurs only when the medium is subject to magnetic influence. 
In perfectly transparent media, the principle of the conservation of 
energy requires that *!> sliould be self-conjugate, i.e,, that for three 
directions at right angles to one another, the function and independent 
variable should coincide in direction. 

In all isotropic media not subject to magnetic influence, it is prob¬ 
able that and reduce to numerical coefficients, as is certainly 
the case with for transparent isotropic media. 

9. Comparing the two values of [E]av<> we have 

™ P0t[U]A„-V[9]A.e = <J>[U]A„ + 'l'[U]A„. (8) 

V 

This equation, in connection with that by which we express the sole- 
noidal character of the displacements, if we regard them as necessarily 
solenoidal, or in connection with thfit which expresses the relation 
between the electrostatic potential and the displacements, if we reject 
the solenoidal hypothesis, may be regarded as the general equation of 
the vibrations of monochromatic light, considered as oscillating elec¬ 
trical fluxes. For the symbol Pot, however, we must substitute the 
symbol representing the operation by which electromotive force is 
calculated from acceleration of flux, with the negative sign, if we are 
not satisfied with the law provisionally adopted, 

* The relations between the displacements in one of the small spaces considoi'ed and 
the average electromotive force is mathematically analogous to the relation between the 
di.splacemonts in a system of a high degree of complexity and certain forces exerted 
from withiHit, which are harmonic functions of the time and under the influence of 
which the system vibrates. The ratio of the displacements to the forces will in general 
vary with the period, and may vary very rapidly. 

An example in which these functions vary very rapidly with the period is afforded 
by the phenomena of selective absorption and abnormal dispersion. 

t A vector is said to be a linear function of another, when the three components of the 
first are homogeneous functions of the first degree of the three components of the second. 
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It is important to observe that the existence of molecular vibrations 
of ponderable matter, due to the pewssage of light through the medium, 
will not affect the reasoning by which this equation has been estab¬ 
lished, provided that the nature and intensity of these vibrations in 
any small part of the medium (as measured by a wave-length) are 
entirely determined by the electrical forces and motions in that part 
of the medium. But the equation would not hold in case of molecular 
vibrations due to magnetic force. Such vibrations would constitute 
an oscillating magnetization of the medium, which has already been 
excluded from the discussion. 

The supposition which has sometimes been made,* that electricity 
possesses a certain mass or inertia, would not at all affect the validity 
of the equation. 

10. The equation may be reduced to a foiiii in some respects more 
simple by the use of the so-called imaginary quantities. We shall 
write I for v^(~l). If we differentiate with respect to the time, and 
47r“ 

substitute —-^[HjAve foJ* [II]Ave> we obtain 

^ Pot ^ 

If we multiply this equation by /, either alone or in connection with 
any real factor, and add it to the preceding, we shall obtain an 
equation which will be equivalent to the two of which it is formed. 

Multiplying adding, we have 








If we set 


(9) 



(10) 



(11) 

our equation reduces to 



Potw-VQ=ew. 

(12) 


In this equation 0 denotes a complex linear vector function, i.e., a 
vector function of which the X-, Y-, and Z-components are expressed 
in terms of the X-, Y-, and Z-components of the independent variable 
by means of coefficients of the form a-f W is a bivector of which 

* See Weber, Ahhandl, d. K. Sacha. Oesellach. d. Wiaa.j vol. vi, pp, 593-597; Lorberg, 
CrelU'a Journal^ vol. bti, p. 56. 
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the real part represents the averaged displacement and the 

coefficient of i the rate of increase of the same multiplied by a 
constant factor. This bivector therefore represents the average 
state of a small part of the field both with respect to position and 
velocity. We may also say that the coefficient of / in W represents 
the value of the averaged displacement [Uj^ve at a time one-quarter 
of a vibration earlier than the time principally considered. 

11. It may serve to fix our ideas to see how W is expressed as a 
function of the time. We may evidently set 

A . A 27r, 

[XlL.e = A, COS y t + Aj sm — t 

where Aj and A 2 ai*^ vectors representing the amplitudes of the two 
parts into which the vibration is resolved. Then 

sinyi + A^cos-^i!, 

and 

- ‘ ^ [Ulive = (Ai-< A.,)(cos +1 sin ; 
that is, if we set A == Aj — (Ag, 

2irtt 

W = Ae’’ (13) 

In like manner we may obtain 

2rrtt 

Q(14) 
where ^ is a biscalar, or complex quantity of ordinary algebra. 
Substituting these values in (12), and cancelling the common factor 
containing the time, we have 

-^PotA-V(/ = eA. (15) 

p- ' 

Our equation is thus reduced to one between A and g, and may easily 
be reduced to one in A alone.* Now A repre.seiits six numerical 
quantities (viz., the three components of Aj, and the three of A.,), 
which may be called the six components of amplitude. The equation, 
therefore, substantially represents the relations between the six com¬ 
ponents of amplitude in difi'erent parts of the field.t The equation 
is, however, not really different from (12), since A and g are only 
particular values of W and Q. 

* The tcrrua VQ, are allowed to remain in these equations, because the beat 
manner of eliminating them will depend somewhat u|K)n our jidmiasion or rejection of 
the .solenoidal hypothesis. 

fThe representation of the six components of amplitude by a single letter should not 
be regai-de.d as an analytical artifice. It only leaves unclividijd in our notation that 
which is undivided in the nature of things. The separation of the six components of 
amplitude is artificial, in that it introduces arbitrary elements into the discussion, viz. 
the directions of the axes of the coordinates, and the zero of time. 




220 equations of monochromatic light in media 


12. From the general equation given above (8, 12, or 15), in con¬ 
nection with the solenoidal hypothesis, we may easily derive the laws 
of the propagation of plane waves in the interior of a sensibly homo¬ 
geneous medium, and the laws of reflection and refraction at surfaces 
between such media. This has been done by Maxwell,* Lorentz,! and 
others,J with fundamental equations more O'" less similar. 

The metliod, however, by which the fundamental equation has 
been established in this paper seems free from certain objections 
which have been brought against the ordinary form of the theory. 
As ordinarily treated, the phenomena are made to depend entirely 
on the inductive capacity and the conductivity of the medium, in a 
manner which may be expressed by the equation 

which will be equivalent to (12), if 



'pC\ /Att^ 


;)(/ 


PotW 


-vq). 


(17) 


where K and C denote in the most general case the linear vector 
functions, but in isotropic bodies the numerical coeflicients, which 
represent inductive capacity and conductivity. By a simple trans¬ 
formation {see (9) and (10)}, this equation becomes 




K pC 

4>ir ^ 2x’ 


(18) 


where 0“' represents the function inverse to 0. 

Now, while experiment appears to verify the existence of such a 
law as is expressed by equation (12), it does not show that 0 has the 
precise form indicated by equation (16). In other words, experiment 
does not satisfactorily verify the relations expressed by (16) and (17), 
if K and C are understood to be the operators (or, in isotropic bodies, 
the numbers) which represent inductive capacity and conductivity in 
the ordinary sense of the terms. 

The discrepancy is most easily shown in the most simple case, when 
the medium is isotropic and perfectly transparent, and 0 reduces to a 
numerical quantity. The .square of the velocity of plane waves is 

then equal to and equation (18) would make it independent of the 


* Phil. Tram., vol. civ (1865), p. 459, or TreatASt on Electj'icity and Marnetiam, 
chap. XX. 

Ze.Uachrift, vol. xxii, pp. 1-30 and 205-219 ; xxiii, pp. 197-210. 
t8ee Fitzgerald, Phil. Tram., vol. clxxi, p. 691 ; J. J. Thomson, Phil. Mag.f 
(5), vol. ix, p. 284; Rayleigh, Phil. Mag. (5), vol. xii. p. 81. 

That the electromagnetic theory of light give.s the conditions relative to the boundary 
of different media, whieh are required by the phenomena of reflection and refraction, 
was first shown by Helmholtz. See Crdle's Jowriuil, V’^ol. Ixxii (1870), p. 57. 
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period; that is, would give no dispersion of colors. The case is 
essentially the same in transparent bodies whicli are not isotropic.* 
The case is worse with metals, which are characterized electrically 
by great conductivity, and optically by great opacity. In their 
papers cited above, Lorentz and Rayleigh have observed that the 
experiments of Jamin on the reflection of light from metallic surfaces 
would often require, as ordinarily interpreted on the electromagnetic 
theory, a negative value for the inductive capacity of the metal. 
This would imply that the electrical equilibrium in the metal is 
unstable. The objection, therefore, is essentially the same as that 
which Lord Rayleigh had previously made to Cauchy’s theory of 
metallic reflection, viz., that the apparent mechanical explanation 
of the phenomena is illusory, since the numerical values given by 
experiment as interpreted on Caucliy's theory would involve an 
unstable equilibi ium of the ether in the metal.t 

13. All tliis points to the same conclusion—that the ordinary view 
of the phenomena is inadequate. The object of tins paper will be 
accomplished, if it has been made clear how a point of view more in 
accordance with what we know of the molecular constitution of 
bodies will give that part of the ordinary theory which is verified by 
experiment, without including that part which is in opposition to 
observed facts, f 

* See note tf> the first paper of Lorentz, cited above, Schloniilch, vol. xxii, p. 23. 

t See Phil. Mag. (4), vol. xliii (1872), p. 321. 

The consideration f)f the i>rucc.sses which we may suppose to take place in the 
smallest parts of a body through which light is transniilt(;d, farther than is necessary to 
eatablish the genenil equation given above, is fon'ign lo the di-sign t»f this paper. Yet a 
word n ay be added with respect to llie difficulties signalized in the ordinal y form of the 
theory. The comparatively simple case of a perfectly transparent body has been 
examined more in detail in cme of the papers already cited, where there is given an 
explanation of the dispersion of colors from the point of view of this paper. It is there 
shown that the effect of the non liornogeneity of the body in it.s smallest parts is to add 
a term to the expres.sion ftu’ the kinetic energy of electrical waves, which fur an isotropic 
body may be roughly descril>ed as similar to that which would be required if the 
electricity had a certain mass or inertia. (Sec especially §§ 7, 9 and 12, [this volume 
pages 185 ff.]) The same must be true of media of any degree of ttpaeity. Now the 
difficulty with the optical properties of the metals is that the real part of G (or 
is in some cases negative. This implies that at a moment of greatest displacement 
the electromotive force is in the direction opposite to the ilisplacement, instead of 
having the same direction, a.s in transparent isotropic bodies. Now a certain part of 
the electromotive force must be reijuired to oppt)se the apparent inertia, and another 
part to oppose the electrical tdasticity of the medium. These parts of the force must 
have opposite directions. In transpanmt bodies the latter part is by far the greater. But 
it need not surprise us that the former should be the greater in some metals. 

It has Vieen remarked by Lorentz that the difficulty 'vith respect to metals would be 
in a measure relieved if we should suppose electricity to have the property of inertia. 
(See § 11 of his third paper, Schlomilch’s Zeitschrift., vol. xxiii, p. 208.) But a supposi¬ 
tion of this kind, taken literally, would involve a dispersion of colors in vacuo, and still 
be inadequate, as I^orentz remarks, vo explain the phenomena observed in metals. 
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EQUATIONS OF MONOCHROMATIC LIGHT, ETC. 


While the writer has aimed at a greater degree of rigor than is 
usual in the establishment of the fundamental equation of mono¬ 
chromatic light, it is not claimed that this equation is absolutely 
exact. The contrary is evident from the fact that the equation 
does not embrace the phenomena which characterize such circularly 
polarizing bodies as quartz. This, however, only implies the neglect 
of extremely small quantities—very small, for example, as compared 
with those which determine the dispersion of colors. In one of the 
papers already cited,* the case of a perfectly transparent body is 
treated with a higher degree of approximation, so as to embrace the 
phenomena in question. 

* See page 195 of this volume. 



XIV. 


A COMPAKISON OF THE ELASTIC AND THE ELECTRICAL 
THEORIES OF LIGHT WITH RESPECT TO THE LAW OF 
DOUBLE REFRACTION AND THE DISPERSION OF COLORS. 

[ATuerican Journal of Science^ ser. 3, vol. xxxv, pp. 467-475, June, 1888.] 

It is claimed for the electrical * theory of light that it is free from 
serious difficulties, which beset the explanation of the phenomena of 
light by the dynamics of elastic solids. Just what these difficulties 
are, and why they do not occur in the explanation of the same 
phenomena by the dynamics of electricity, has not perhaps been 
shown with all the simplicity and generality which might be desired. 
Such a treatment of the subject is however the more necessary on 
account of the ever-increasing bulk of the literature on either side, 
and the confusing multiplicity of the elastic theories. It is the object 
of this paper to supply this want, so far as respects the propagation 
of plane waves in transparent and sensibly homogeneous media. The 
simplicity of this part of the subject renders it appropriate for the 
first test of any optical theory, while the precision of which the 
experimental determinations are capable, renders the test extremely 
rigorous. 

It is moreover, as the writer believes, an appropriate time for the 
discussion proposed, since on one hand the experimental verification 
of Fresnel’s Law has recently been carried to a degree of precision far 
exceeding anything which we have had before,! and on the other, the 

* The term electrical seems the most simple and appropriate to describe that theory of 
light which makes it consist in electrical motions. The cases in which any distinctively 
magnetic action is involved in the phenomena of light are so exceptional, that it is 
difficult to see any sufficient reason why the general theory should be called electro- 
magnetic^ unless we are to call all phenomena electromagnetic which depend on the 
motions of electricity. 

t In the recent experiments of Professor Hastings relating to the index of refraction of 
the extraordin^ary ray in Iceland spar for the spectral line Dg and a wave-normal inclined 
at about 31* to the optic axis, the difference between the observed and the calculated 
values was only two or three units in the sixth decimal place (in the seventh significant 
figure), which was about the probable error of the determinations. Sec Am. Jour. Set. 
ser. 3, vol. xxxv, p. 60. 
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discovery of a remarkable theorem relating to the vibrations of a 
strained solid * has given a new impulse to the study of the elastic 
theory of light. 

Lefc us hrst consider the facts to which a correct theory must 
conform. 

It is generally admitted that the phenomena of light consist 
in motions (of the type which we call wave-motions) of some¬ 
thing which exists both in space void of ponderable matter, and 
in the spaces between the molecules of bodies, perhaps also in 
the molecules themselves. The kinematics of these motions is 
pretty well undersbxxl; the question at issue is whether it agrees 
witli the dynamics of elastic solids or with the dynamics of 
electricity. 

In the case of a simple harmonic wave-motion, which alone we need 
consider, the wave-velocity (V) is the quotient of the wave-length (I) 
by the period of vibration (j>). These quantities can be determined 
with extreme accuracy. In media which are sensibly homogeneous 
but not isotropic the wave-velocity V, for any constant value of the 
period, is a quadratic function of the direction cosines of a certain 
line, viz., the normal to the so-called “ plane of polarization.” The 
physical characteristics of this line have been a matter of dispute. 
Fresnel considered it to be the direction of displacement. Others 
have maintained that it is the common perpendicular to the wave- 
normal and the displacement. Other-s again would define it as 
that component of the displacement which is perpendicular to the 
wave-normal. This of course would differ from Fresnel’s view only 
in case the displacements are not perpendicular to the wave-normal, 
and would in that case be a necessary modification of his view. 
Although this dispute has been one of the most celebrated in 

physics, it seems to be at length substantially settled, most directly 
by experiments upon the scattering of light by small particles, 

which seems to show decisively that in isotropic iriedia at least 

the displacements are normal to the “ plane of polarization,” and 
also, with hardly less cogency, by the difficulty of accounting 

for the intensities of reflected and refracted light on any other 


“.Sir Wm. Thomson has shown that if an elastic incompressible solid in which the 
potential energy of any homogeneous strain is proportional to the sum of the squares of 
the reciprocals of the principal elongations three is subjected to any homogeneous 

strain by forces applied to its surface, the transmission of plane waves of distortion, 
supcrpo.se<l on this homogeneous strain, will follow exactly Fresnel’s law (including the 
direction of displacement), the three principal velocities being proportional to the 
reciprocals of the principal elongations. It mu.st lx* a surprise to mathematicians and 
physicists to learn that a theorem of such simplicity and beauty has been waiting to be 
discovered in a field which has been so carefully gleaned. See page 116 of the current 
volume (xxv) (d' the Philosophical Magazine. 
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supposition.* It should be added that all diversity of opinion on 
this subject has been confined to those whose theories are based on 
the dynamics of elastic bodies. Defenders of the electrical theory 
have always placed the electrical displacement at right angles to the 
“plane of polarization.*’ It will, however, be better to assume this 
direction of the displacement as probable rather than as absolutely 
certain, not so much because many are likely to entertain serious 
doubts on the subject, as in order not to exclude views which have 
at least a historical interest. 

The wave-velocity, then, for any constant period, is a quadratic 
function of the cosines of a certain direction, which is probably that 
of the displacement, but in any case determined by the displacement 
and the wave-normal. The coefficients of this (quadratic function are 
functions of the period of vibration. It is important to notice that 
these coefficients vary separately, and often quite differently, with the 
period, and that the case does not at all resemble that of a quadratic 
function of the direction-cosines multiplied by a quantity depending 
on the period. 

In discussing the dynamics of the subject we may gain something 
in simplicity by considering a system of stationary waves, such as 
results from two similar systems of progressive waves moving in 
opposite directions. In such a system the energy is alternately 
entirely kinetic and entirely potential. Since the total energy is 
constant, we may set the average kinetic energy per unit of volume 
at the moment when there is no potential energy, equal to the average 
potential energy per unit of volume when there is no kinetic energy.! 
We may call this the equation of energies. It will contain the 
quantities I and and thus furnish an expression for the velocity 
of either system of progressive waves. We have to see whether the 
elastic or the electric theory gives the expression most conformed to 
the facts. 

Let us first apply the elastic theory to the case of the so-called 

* “ At the same time, if the above reasoning be valid, the qiiestion as to the direction 
of the vibrations in polarized light is decided in accordance with the view of Fresnel. 
... I confess I cannot see any room for doubt as to the result it leads to. . . . I only 
mean that if light, as is generally supposed, consists of transversal vibrations similar to 
those which take place in an elastic solid, the vibration must be normal to the plane of 
polarization.” Lord Rayleigh “ On the Light from the Sky, its Polarization and Color;” 
Phil. Matj, (4), xli (1871), p. 109. 

“ Green’s d3mamic8 of polarization by reflexion, and Stokes’ dynamics of the diffraction 
of polarized light, and Stokes’ and Riiyleigh’s d3mamic8 of the blue sky, all agree in, as 
it seems to me, irrefragably, demonstrating Fresnel’s original conclusion, that in plane 
polarized light the line of vibration is perpendicular to the plane of polarization.” 
Sir Wm. Thomson, loc. citat. 

tThe terms kinetic energy and poteiUkd energy will be used in this paper to denote 
these average values. 
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vacuum. If we write h for the amplitude measured in the middle 

between two nodal planes, the velocities of displacement will be as 

h . A* . 

-, and the kinetic energy will be represented by A^, where A is a 

constant depending on the density of the medium. The potential 
energy, which consists in distortion of the medium, may be represented 
A® 

by B p, where B is a constant depending on the rigidity of the 
medium. The equation of energies, on the elastic theory, is therefore 





(1) 

which gives 

V*: 

p* A' 

(2) 


In the electrical theory, the kinetic energy is not determined by the 
simple formula of ordinary d3nQamic8 from the square of the velocity 
of each element, but is found by integrating the product of the 
velocities of each pair of elements divided by the distance between 
them. Very elementary considerations suffice to show that a quantity 
thus determined when estimated per unit of volume will vary as the 

7^2 

square of the wave-length. We may therefore set for the kinetic 

energy, F being a constant. The potential energy does not consist 
in distortion of the medium, but depends upon an elastic resistance 
to the separation of the electricities, which constitutes the electrical 
displacement, and is proportioned to the square of this displacement. 
The average value of the potential energy per unit of volume will 
therefore be represented in the electrical theory by GA®, where G is a 
constant, and the equation of energies will be 


which gives 


A2 

p' 


P G 


V« = -„= 
P^ 


(3) 

(4) 


Both theories give a constant velocity, as is required. But it is 
instructive to notice the profound difference in the equations of energy 
from which this result is derived. In the elastic theory the square of 
the wave-length appears in the potential energy as a divisor; in the 
electrical theory it appears in the kinetic energy as a factor. 

Let us now consider how these equations will be modified by the 
presence of ponderable matter, in the most general case of transparent 
and sensibly homogeneous bodies. This subject is rendered much 
more simple by the fact that the distances between the ponderable 
molecules are very small compared with a wave-length. Or, what 
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amounts to the same thing, but may present a more distinct picture 
to the imagination, the wave-length may be regarded as enormously 
great in comparison with the distances between neighboring molecules. 
Whatever view we take of the motions which constitute light, we can 
hardly suppose them (disturbed as they are by the presence of the 
ponderable molecules) to be in strictness represented by the equations 
of wave-motion. Yet in a certain sense a wave-motion may and does 
exist. If, namely, instead of the actual displacement at any point, 
we consider the average displacement in a space large enough to 
contain an immense number of molecules, and yet small as measured 
by a wave-length, such average displacements may be represented 
by the equations of wave-motion; and it is only in this sense that 
any theory of wave-motion can apply to the phenomena of light in 
transparent bodies. When we speak of displacements, amplitudes, 
velocities (of displacement), etc., it must therefore be understood in 
this way. 

The actual kinetic energy, on either theory, will evidently be 
greater than that due to the motion thus averaged or smoothed, and 
to a degree presumably depending on the direction of the displace¬ 
ment. But since displacement in any direction may be regarded as 
compounded of displacements in three fixed directions, the additional 
energy will be a quadratic function of the components of velocity of 
displacement, or, in other words, a quadratic function of the direction- 
cosines of the displacement multiplied by the square of the amplitude 
and divided by the square of the period.* This additional energy 
may be understood as including any part of the kinetic energy of 
the wave-motion which may belong to the ponderable particles. The 
term to be added to the kinetic energy on the electric theory may 

therefore be written /d- 2 , where is a quadratic function of the 

direction-cosines of the displacement. The elastic theory requires a 
term of precisely the same character, but since the term to which it 
is to be added is of the same general form, the two baay be incor- 

porated in a single term of the form where Ad is a quadratic 

function of the direction-cosines of the displacement. We must, 
however, notice that both Ad and /d are not entirely independent of 
the period. For the manner in which the flux of the luminiferous 
medium is distributed among the ponderable molecules will naturally 
depend somewhat upon the period. The same is true of the degree 
to which the molecules may be thrown into vibration. But Ad and 
/d will be independent of the wave-length (except so far as this is 

* For proof in extmm of this proposition, when the motions are supposed electrical, 
the reader is referred to page 187 of this volume. 
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connected with the period), because the wave-length is enormously 
great compared with the size of the molecules and the distances 
between them. 

The potential energy on the elastic theory must be increased by 
a term of the form where bjy is a quadratic function of the 

direction-cosines of the displacement. For the ponderable particles 
must oppose a certain elastic resistance to the displacement of the 
ether, which in SBolotropic bodies will presumably be different in 
different directions. The potential energy on the electric theory will 
be represented by a single term of the same form, say Gd where 
a quadratic function of the direction-cosines of the displacement, Gd, 
takes the place of the constant G, which was sufficient when the 
ponderable particles were absent. Both G^ and bjy will vary to some 
extent with the period, like A^ and/^, and for the same reason. 

In regard to that potential energy, which on the elastic theory is 
independent of the direct action, of the ponderable molecules, it has 
been supposed that in seolotropic bodies the effect of the molecules is 
such as to produce an seolotropic state in the ether, so that the energy 
of a distortion varies with its orientation. This part of the potential 

energy will then be reprCvSented by Bni)| 2 ^ where B^p is a function of 

the directions of the wave-normal and the displacement. It may 
easily be shown that it is a quadratic function both of the direction- 
cosines of the wave-normal and of those of the displacement. Also, 
that if the ether in the body when undisturbed is not in a state of 
stress due to forces at the surface of the body, or if its stress is 
uniform in all directions, like a hydrostatic pressure, the function B,^p 
must be symmetrical with respect to the two sets of direction-cosines. 

The equation of energies for the elastic theory is therefore 

(5) 

which gives * V* = = -j—— 5 . ( 6 ) 

The equation of energies for the electrical theory is 

ri’|+/:>P=Gi.A*, (7) 

which gives V*=p=^-A. ( 8 ) 

It is evident at once that the electrical theory gives exactly the 
form that we want. For any constant period the square of the 
wave-velocity is a quadratic function of the direction-cosines of 
the displacement. When the period varies, this function varies, 
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the different coefficients in the function varying separately, because 
Gd and /d will not in general be similar functions.* If we consider a 
constant direction of displacement while the period varies, G® and 
/j) will only vary so far as the type of the motion varies, i.e., so far 
as the manner in which the flux distributes itself among the 
ponderable molecules and intermolecular spaces, and the extent to 
which the molecules take part in the motion are changed. There 
are cases in which these vary rapidly with the period, viz., cases 
of selective absorption and abnormal dispersion. But we may fairly 
expect that there will be many cases in which the character of the 
motion in these respects will not vary much with the period. 
Of. 

and p will then be sensibly constant and we have an approximate 

expression for the general law of dispersion, which agrees remarkably 
well with experiment.t 

If we now return to the equation of energies obtained from the 
elastic theory, we see at once that it does not suggest any such 
relation as experiment has indicated, either between the wave-velocity 
and the direction of displacement, or between the wave-velocity and 
the period. It remains to be seen whether it can be brought to 
agree with experiment by any hypothesis not too violent. 

In order that may be a quadratic function of any set of 
direction-cosines, it is necessary that and shall be independent 
of the direction of the displacement, in other words, in the case of 
a crystal like Iceland spar, that the direct action of the ponderable 
molecules upon the ether, shall affect both the kinetic and the 
potential energy in the same way, w^hether the displacement take 
place in the direction of the optic axis or at right angles to it. 
This is contrary to everything which we should expect. If, never¬ 
theless, we make this supposition, it remains to consider This 

must be a quadratic function of a certain direction, which is almost 
certainly that of the displacement. If the medium is free from 
external stress (other than hydrostatic), Bj,d, as we have seen, is 
symmetrical with respect to the wave-normal and the direction 
of displacement, and a quadratic function of the direction-cosines 
of each. The only single direction of which it can be a function is 
the common perpendicular to these two directions. If the wave- 
normal and the displacement are perpendicular, the direction-cosines 

* But Gd , /d , and V®, considered as functions of the direction of displacement, are all 
subject to any law of symmetry which may belong to the structure of the body 
considered. The resulting optical characteristics of the different crystallographic 
systems are given on pages 192-194. 

fThis will appesr most distinctly if we consider that V divided by the velocity of 
light in vacuo gives the reciprocal of the index of refraction, and p multiplied by the 
same quantity gives the wave-length in vacuo. 
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of the common perpendicular to both will be linear functions of the 
direction-cosines of each, and a quadratic function of the direction- 
cosines of the common perpendicular will be a quadratic function of 
the direction-cosines of each. We may thus reconcile the theory 
with the law of double refraction, in a certain sense, by supposing 
that Aj) and 6 d are independent of the direction of displacement, 
and that Bj,d and therefore V® is a quadratic function of the direction- 
cosines of the common perpendicular to the wave-normal and the 
displacement. But this supposition, besides its intrinsic improbability 
so far as and hj) are concerned, involves a direction of the dis¬ 
placement which is certainly or almost certainly wrong. 

We are thus driven to suppose that the undisturbed medium is in 
a state of stress, which, moreover, is not a simple hydraulic stress. 
In this case, by attributing certain definite physical properties to 
the medium, we may make the function Bnt> become independent of 
the direction of the wave-normal, and reduce to a quadratic function 
of the direction-cosines of the displacement.* This entirely satisfies 
Fresnel’s Law, including the direction of displacement, if we can 
suppose Ad and 6 d independent of the direction of displacement. 
But this supposition, in any case difficult for aeolotropic bodies, 
seems quite irreconcilable with that of a permanent (not hydrostatic) 
stress. 

For this stress can only be kept up by the action of the ponderable 
molecules, and by a sort of action which hinders the passage of the 
ether past the molecules. Now the phenomena of reflection and 
refraction would be very different from what they are, if the optical 
homogeneity of a crystal did not extend up very close to the surface. 
This implies that the stress is produced by the ponderable particles 
in a very thin lamina at the surface of the crystal, much less in 
thickness, it would seem probable, than a wave-length of yellow 
light. And this again implies that the power of the ponderable 
particles to pin down the ether, as it were, to a particular position is 
very great, and that the term in the energy relating to the motion 
of the ether relative to the ponderable particles is very important. 
This is the term containing the factor fep* which it is difficult to 
suppose independent of the direction of displacement because the 
dimensions and arrangement of the particles are different in different 
directions. But our present hypothesis has brought in a new 
reason for supposing hjy to depend on the direction of displacement, 
viz., on account of the stress of the medium. A general displacement 
of the medium midway between two nodal planes, when it is 
restrained at innumerable points by the ponderable particles, will 


See note on page 224. 
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produce special distortions due to these particles. The nature of 
these distortions is wholly determined by the direction of displace¬ 
ment, and it is hard to conceive of any reason why the energy of 
these distortions should not vary with the direction of displacement, 
like the energy of the general distortion of the wave-motion, which 
is partly determined by the displacement and partly by the wave- 
normal.* 

But the difficulties of the elastic theory do not end with the 
law of double refraction, although they are there more conspicuous 
on account of the definite and simple law by which they can be 
judged. It does not easily appear how the equation of energies 
can be made to give anything like the proper law of the dispersion 
of colors. Since for given directions of the wave-normal and dis¬ 
placement, or in an isotropic body, is constant, and also Ajy and 6i>, 
except so far as the type of the vibration varies, the formula requires 
that the square of the index of refraction (which is inversely as V^) 
sliould be equal to a constant diminished by a term proportional to 
the square of the period, except so far as this law is modified by a 
variation of the type of vibration. But experiment shows nothing 
like this law. Now the variation in the type of vibration is some¬ 
times very important,—it plays the leading role in the phenomena 
of selective absorption and abnormal dispersion,—but this is certainly 
not always the case. It seems hardly possible to suppose that the 
type of vibration is always so variable as entirely to mask the law 
which is indicated by the formula when Ap and bj) (with B^d) 
regarded as constant. This is especially evident when we consider 
that the effect on the wave-velocity of a small variation in the type of 
vibration will be a small quantity of the second order.! 

The phenomena of dispersion, therefore, corroborate the conclusion 
which seemed to follow inevitably from the law of double refraction 
alone. 


* The reader may perhaps ask how the above reasoning is to be reconciled with the 
fact that the law of double refraction has been so often deduced from the elastic theory. 
The troublesome terms are hp and the variable part of Ap, which express the direct 
action of the ponderable molecules on the ether. 8o far as the {([uite limited) reading 
and recollection of the present writer extend, those who have sought to derive the law 
of double refraction from the theory of elastic solids have generally either neglected this 
direct action—a neglect to which Professor Stokes calls attention more than once in his 
celebrated “Report on Double Refraction” (Srii. Assoc., 1862, pp. 264, 268)—or taking 
account of this action they have made shipwi-eck upon a law different from Fresnel’s and 
contradicted by experiment. 

tSee pages 190, 191 of this volume, or Lord Rayleigh’s Theory of Sound, vol. i, 
p. 84. 
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A COMPARISON OF THE ELECTRIC THEORY OF LIGHT 
AND SIR WILLIAM THOMSON’S THEORY OF A QUASI- 
LABILE ETHER. 

[American Journal of Science, ser. 3, vol. xxxvii, pp. 139-144, February, 1889.] 

A REMARKABLE paper by Sir William Thomson, in the November 
number of the Phihmyphical Magazine, has opened a new vista in 
the possibilities of the theory of an elastic ether. Since the general 
theory of elasticity gives three waves characterized by different 
directions of displacement for a single wave-plane, while the pheno¬ 
mena of optics show but two, the first point in accommodating any 
theory to observation, is to get rid (absolutely or sensibly) of the 
third wave. For this end, it has been common to make the ether 
incompressible, or, as it is sometimes expressed, to make the velocity 
of the third wave infinite. The velocity of the wave of compression 
becomes in fact infinite as the compressibility vanishes. Of course 
it has not escaped the notice of physicists that we may also get 
rid of the third wave by making its velocity zero, as may be done 
by giving certain values to the constants which express the elastic 
properties of the medium, but such values have appeared imjx>8sible, 
as involving an unstable state of the medium. The condition of 
incompressibility, absolute or approximate, has therefore appeared 
necessary.* This question of instability has now, however, been 
subjected to a more searching examination, with the result that 
the instability does not really exist provided we either suppose the 
medium to extend all through boundless space, or give it a fixed 
containing vessel as its boundary!' This renders possible a very 
simple theory of light, which has been shown to give Fresnel’s laws 
for the intensities of reflected and refracted light and for double 
refraction, so far as concerns the phenomena which can be directly 
observed. The displacement in an aeolotropic medium is in the same 
plane passing through the wave-normal as was supposed by Fresnel, 

* It was under this impression that the paper entitled “ A Comparison of the Elastic 
and the Electric Theories of Light with respect to the Law of Double Refraction and the 
Dispersion of Colors,” [this volume pp. 223-231], was written. The conclusions of 
that paper, except so far as respects the dispersion of colors, will not apply to the 
new theor 3 \ 



COMPARISON OF THE ELECTRIC THEORY OF LIGHT, ETC. 233 


but its position in that plane is different, being perpendicular to the 
ray instead of to the wave-normal * 

It is the object of this paper to compare this new theory with 
the electric theory of light. In the limiting cases, that is, when we 
regard the velocity of the missing wave in the elastic theory as zero, 
and in the electric theory as infinite, we shall find a remarkable 
correspondence between the two theories, the motions of monochromatic 
light within isotropic or aeolotropic media of any degree of trans¬ 
parency or opacity, and at the boundary between two such media, 
being represented by equations absolutely identical, except that the 
symbols which denote displacement in one theory denote force in 
the other, and vice versd.i In order to exhibit this correspondence 
completely and clearly, it is necessary that the fundamental principles 
of the two theories should be treated with the same generality, and, 
so far as possible, by the same method. The immediate consequences 
of the new theory will therefore be deduced with the same generality 
and essentially by the same method which has been used with 
reference to the electric theory in a former volume of this Journal 
[page 211 of this volume]. 

The elastic properties of the ether, according to the new theory, 
in its limiting ccise, may be very simply expressed by means of a 
vector operator, for which we shall use Maxwells designation. The 
cv/ri of a vector is defined to be another vector so derived from the 
first that if u, v, w be the rectangular components of the first, and 
u\ v', w\ those of its curl, 

, dw dv , du dw , dv dw 
^=d^-dz' ^’^dx—dy’ 

where £c, y, s are rectangular coordinates. With this understanding, 
if the displacement of the ether is represented by the vector the 
force exerted ujx)n any element by the surrounding ether will be 

— B curl curl S dx dy dz, (2) 

where B is a scalar (the so-called nigidity of the ether) having the 
same constant value throughout all space, whether ponderable matter 
is present or not. 

Where there is no ponderable matter, this force must be equated to 
the reaction of the inertia of the ether. This gives, with omission of 
the common factor dx dy dz, 

A(i = -B curl curl (5, (3) 

where A denotes the density of the ether. 

* Sir William Thomson, loc. citat. R. T. Glazebrook, Phil. Mag.y December, 1888. 

tin giving us a new interpretation of the equations of the electric theory, the author 
of the new theory has in fact enriched the mathematical theory of physios with some- 
thing which may be compared to the celebrated principle of duality in geometry. 
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The presence of ponderable matter disturbs the motions of the 
ether, and renders them too complicated for us to follow in detail 
Nor is this necessary, for tlie quantities which occur in the equations 
of optics represent average values, taken over spaces large enough to 
smooth out the irregularities due to the ponderable particles, although 
very small as measured by a wave-length.* Now the general prin¬ 
ciples of harmonic motion t show that to maintain in any element of 
volume the motion represented by 

= ^ (4) 

21 being a complex vector constant, will require a force from outside 
represented by a complex linear vector function of (S, that is, the 
three components of the force will be complex linear functions of the 
three components of (S. We shall represent this force by 

dx dy dz, (5) 

where ^ represents a complex linear vector function.}; 

If we now equate the force required to maintain the motion in any 
element to that exerted upon the element by the surrounding ether, 
we have the equation ^ 

which expresses the general law for the motion of monochromatic 
light within any sensibly homogeneous medium, and may be regarded 
as implicitly including the conditions relating to the boundary of two 
such media, which are necessary for determining the intensities of 
reflected and refracted light. 

For let u, V, w be the components of (5, 

u\ v\ w' „ „ curl (5, 

u'\ v\ w‘ „ „ curl curl (5, 

so that ,__dvo dv dw du 

^ ^ dy dz' ^ dz dx' ^ dx 

„ dw dv' „ du' dw' ,, dv' du' 


and let the interface be perpendicular to the axis of Z. It is evident 


* This is in no respect different from what is always tacitly understood in the theory 
of sound, where the displacements, velocities, densities considered are always suoh 
average values. But in the theory of light, it is desirable to have the fact clearly in 
mind on account of the two interpenetrating media (imponderable and ponderable), the 
laws of light not being in all respect^ the some as they would be for a single homo¬ 
geneous medium. 

+ See Lord Rayleigh’s Theory of S(mnd, vol. i, chapters iv, v. 

::|;lt amounts essentially to the some thing, whether we regard the force as a linear 
vector function of (S or of since these differ only by the constant factor - But 

there are some advantages in expressing the force as a function of because the 

greater part of the force, in the most important cases, is required to overcome the inertia 
of the ether, and is thus more immediately connected with (j. 
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that if v/ or v' is discontinuous at the interface, the value of u" or v" 
becomes in a sense infinite, i.e., curl curl (5, and therefore by (6) 
“^(5, will be infinite. Now both (5 and ^ are discontinuous at the 
interface, but infinite values for are not admissible. Therefore 
u' and v' are continuous. Again, if u or is discontinuous, u' or v' 
will become infinite, and therefore u* or v". Therefore u and v are 
continuous. These conditions may be expressed in the most general 
manner by saying that the components of @ and curl (£ parallel to the 
interface are continuous. This gives four complex scalar conditions, 
or in all eight scalar conditions, for the motion at the interface, which 
are sufficient to determine the amplitude and phase of the two 
reflected and the two refracted rays in the most general case. It is 
easy, however, to deduce from these four complex conditions, two 
others, which are interesting and sometimes convenient. It is evident 
from the definitions of w' and w" that if u, v, u, and v' are continuous 
at the interface w' and w" will also be continuous. Now — is equal 
to the component of normal to the interface. The following 
quantities are therefore continuous at the interface : 

the components parallel to the interface of (5, ] 

the component normal to the interface of V (7) 

all components of curl (5.j 

To compare these results with those derived from the electrical 
theory, we may take the general equation of monochromatic light on 
the electrical hypothesis from a paper in a former volume of this 
Journal. This equation, which with an unessential difference of nota¬ 
tion may be written * 

— Pot 8 —VQ==4*7r$g, (8) 

was established by a method and considerations similar to those which 
have been used to establish equation (6), except that the ordinary law 
of electrodynamic induction had the place of the new law of elasticity. 

is a complex vector representing the electrical displacement as a 
harmonic function of the time; ^ is a complex linear vector operator, 
such that represents the electromotive force necessary to keep 

up the vibration Q is a complex scalar representing the electro¬ 
static potential, VQ the vector of which the three components are 
dQ c£Q dQ 
dx* dy* dz* 

Pot denotes the operation by which in the theory of gravitation the 
potential is calculated from the density of matter.! When it is 

* See page 218 of this volume, eqiiation (12). 

t The symbol - Pot is therefore equivalent to as used by Sir William Thomson 

(with a happy economy of symbols) at the last meeting of British Association do express 
the same law of electrodynamic induction, except that the symbol is here used as a 
vector operator. See Nature, vol. xxxviii, p. 571, sub init. 
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applied as here to a vector, the three components of the result are to 
be calculated separately from the three components of the operand, 
-VQ is therefore the electrostatic force, and —Pot 5 electro¬ 
dynamic force. In establishing the equation, it was not assumed that 
the electrical motions are 8ole7ioidal, or such as to satisfy the so-called 
“ equation of continuity.” We may now, however, make this assump¬ 
tion, since it is the extreme case of the electric theory which we are 
to compare with the extreme case of the elastic. 

It results from the definitions of curl and V that curl VQ = 0. We 
may therefore eliminate Q from equation (8) by taking the curl. 
This gives 

— curl Pot — 47r curl ^5* (9) 

Since curl curl and ^ Pot are inverse operators for soleuoidal vectors. 

we may get rid of the symbol Pot by taking the curl again. We 
thus get 

— 3’ = curl curl (10) 

. The conditions for the motion at the boundary between different 
media are easily obtained from the following considerations. Pot 3 
and Q are evidently continuous at the interface. Therefore the 
components parallel to the interface of VQ, and by (8) of will be 
continuous. Again, curl Pot 3 is continuous at the interface, as 
appears from the consideration that curl Pot 3 is the magnetic force 
due to the electrical motions 3- Therefore, by (9), curl ^3 i® con¬ 
tinuous. The solenoidal condition requires that the component of 3 
normal to the interface shall be continuous. 

The following quantities are therefore continuous at the interface: 

the components parallel to the interface of 4*3, ) 

the component normal to the interface of 3» [ (11) 

all components of curl ^3-j 

Of these conditions, the two relating to the normal components of 3 
and curl 4»3 are easily shown to result from the other four conditions, 
as in the analogous case in the elastic theory. 

If we now compare in tne two theories the differential equations of 
the motion of monochromatic light for the interior of a sensibly 
homogeneous medium, (6) and (10), and the special conditions for the 
boundary between two such media as represented by the continuity of 
the quantities (7) and (11), we find that these equations and conditions 
become identical, if 



(12) 


(13) 

7r=^-\ 

(14) 
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In other words, the displacements in either theory are subject to the 
same general and surface conditions as the forces required to maintain 
the vibrations in an element of volume in the other theory. 

To fix our ideas in regard to the signification of and we may 
consider the case of isotropic media, in which these operators reduce 
to ordinary algebraic quantities, simple or complex. Now the curl 
of any vector necessarily satisfies the solenoidal condition (the 
so-called “ equation of continuity ”), therefore by (C) and (S will 
be solenoidal. So also will J electrical theory. Now 

for solenoidal vectors 

-curlcurl = ^^ + ^ + |„ (15) 

so that the equations (6) and (10) reduce to 


___ _ 

\dx^ dy -^ dz^J 






(16) 

(17) 


For a simple train of waves, the displacement, in either theory, may 


be represented by a constant multiplied by 


^L{jjt-^ax-y-by+cz) 

(18) 

Our equations then reduce again to 



(19) 

4- -f C-) ^ 5 . 

Hence 

(20) 

xir -1 _ (b = 

(21) 


The last member of this equation, when real, evidently expresses the 
square of the velocity of light. If we set 






( 22 ) 


k denoting the velocity of light in vdcuo, we have 

= = (23) 


When is positive, which is the case of perfectly transparent 
bodies, the positive root of n^ is called the index of refraction of the 
medium. In the most general case, it would be appropriate to call 
n —or perhaps that root of n^ of which the real part is positive—the 
(complex) index of refraction, although the terminology is hardly 
settled in this respect. A negative value of n^ would represent a 
body from which light would be totally reflected at all angles of 
incidence. No such cases have been observed. Values of n^ in which 
the coeflScient of i is negative, indicate media in which light is 
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absorbed. Values in which the coefficient of t is positive would 
represent media in which the opposite phenomenon took place.’’^ 

It is no part of the object of this paper to go into the details 
by wLich we may derive, so far as observable phenomena are con¬ 
cerned, Fresnel’s law of double refraction for transparent bodies, 
as well as the more general law of the same character which relates to 
aeolotropic bodies of more or less opacity, and which differs from 
Fresnel’s only in that certain quantities become complex, or Fresnel’s 
laws for the intensities of reflected and refracted light at the boun¬ 
dary of transparent isotropic media, with the more general laws for 
the case of bodies aeolotropic or opaque or both. The principal cases 
have already been discussed on the new elastic theory in the 
Philosophical Magazine t and a further discussion is promised. For 
the electrical theory, the case of double refraction in perfectly trans¬ 
parent media has been discussed quite in detail in this Journal, I and 
the intensities of reflected and refracted light have been abundantly 
deduced from the above conditions by various authors. § So far as all 
these laws are concerned, the object of this paper will be attained if 
if it has been made clear that the two theories, in their extreme cases, 
give identical results. The greater or less degree of elegance, or 
completeness, or perspicuity, with which these laws may be developed 
by different authors, should weigh nothing in favor of either theory. 

The non-magnetic rotation of the plane of polarization, with the 
allied phenomena in aeolotropic bodies, lie in a certain sense outside 
of the above laws, as depending on minute quantities which have been 
neglected in this discussion. The manner in which these minute 
quantities affect the equations of motion on the electrical theory has 
been shown in a former paper, || where these phenomena in trans¬ 
parent bodies are treated quite at length. For the new theory, a 
discussion of this subject is promised by Mr. Glazebrook. 

But the magnetic rotation of the plane of polarization, with the 
allied phenomena when an aeolotropic body is subjected to magnetic 
influence, fall entirely within the scope of the above equations and 
surface-conditions. The characteristic of this case is that 'F and ^ 
are not self-conjugate. IF This is what we might expect on the electric 

* But t might have been introduced into the equations in such a way that a positive 
ooeffioient in the value of n* would indicate absorption, and a negative ooeffioient the 
impossible case. 

tSir William Thomson, loc. citaC. R. T. Glazebrook, /oc. citcU. 

J This vol. p. 182. 

filorentz, SchlSmiloh’s Zeitschrift, vol. xxii, pp. 1-30 and 205-219; vol. xxiii, 
pp. 197-210; Fitzgerald, Phil. Trane, t vol. olxxi, p. 691; J. J. Thomscm, PhU. 
Mag. ( 6 ), voL ix, p. 284 ; Rayleigh, PAH. Mag. ( 6 ), vol. xii, p. 81. Glazebrook, Proc. 
Oambr. Phil. Soc.^ vol. iv, p. 156. 

U This vol. p. 195. 

H See this voL p. 217- 
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theory from the experiments of Dr. Hall, which show that the 
operators expressing the relation between electromotive force and 
current are not in general self-conjugate in this case. 

In the preceding comparison, we have considered only the limiting 
cases of the two theories. With respect to the sense in which the 
limiting case is admissible, the two theories do not stand on quite 
the same footing. In the electric theory, or in any in which the 
velocity of the missing wave is very great, if we are satisfied that 
the compressibility is so small as to produce no appreciable results, 
we may set it equal to zero in our mathematical theory, even if we 
do not regard this as expressing the actual facts with absolute 
accuracy. But the case is not so simple with an elastic theory in 
which the forces resisting certain kinds of motion vanish, so far 
at least as they are proportional to the strains. The first requisite 
for any sort of optical theory is that the forces shall be proportional 
to the displacements. This is easily obtained in general by supposing 
the displacements very small. But if the resistance to one kind of 
distortion vanishes, there will be a tendency for this kind of distortion 
to appear at some places in an exaggerated form, and even to an 
infinite degree, however small the displacements may be in other 
parts of the field. In the case before us, if we suppose the velocity 
of the missing wave to be absolutely zero, there will be infinite 
condensations and rarefactions at a surface where ordinary waves 
are reflected. That is, a certain volume of ether will be condehsed 
to a surface, and vice versd. This prevents any treatment of the 
extreme case, which is at once simple and satisfactory. The diflBculty 
has been noticed by Sir William Thomson, who observes that it 
may be avoided if we suppose the displacements infinitely small in 
comparison with the wave-length of the wave of compression. This 
implies a finite velocity for that wave. A similar difficulty would 
probably be found to exist (in the extreme case) with regard to 
the deformation of the ether by the molecules of ponderable matter, 
as the ether oscillates among them. If the statical resistance to 
irrotational motions is zero, it is not at all evident that the statical 
forces evoked by the disturbance caused by the molecules would be 
proportional to the motions. But this difficulty would be obviated by 
the same hypothesis as the first. 

These circumstances render the elastic theory somewhat less con¬ 
venient as a working hypothesis than the electric. They do not 
necessarily involve any complication of the equations of optics. For 
^it may still be possible that this velocity of the missing wave is 
so small that the quantities on which it depends may be set equal 
to zero in the equations which represent the phenomena of optics. 
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But the mental processes by which we satisfy ourselves of the 
validity of our results (if we do not work out the whole problem 
in the general case of no assumption in regard to the velocity of 
the missing wave) certainly involve conceptions of a higher degree 
of difficulty on account of the circumstances mentioned. Perhaps 
this ought not to affect our judgment with respect to the question 
of the truth of the hypothesis. 

Although the two theories give laws of exactly the same form 
for monochromatic light in the limiting case, their deviations from 
this limit are in opposite directions, so that if the phenomena of 
optics differed in any marked degree from what we would have in 
the limiting case, it would be easy to find an experimentum crucis 
to decide between the two theories. A little consideration will make 
it evident, that when the principal indices of refraction of a crystal are 
given, the intermediate values for oblique wave-planes will be less if 
the velocity of the missing wave is small but finite, than if it is infini¬ 
tesimal, and will be greater if the velocity of the missing wave is very 
great than if it is infinite.* Hence, if the velocity of the missing 
wave is small but finite, the intermediate values of the indices of 
refraction will be less than are given by Fresnel’s law, but if the 
velocity of the missing wave is very great but finite, the intermediate 
values of the indices of refraction will be greater than are given by 
Fresnel’s law. But the recent experiments of Professor Hastings on 
the law of double refraction in Iceland spar do not encourage us to 
look in this direction for the decision of the question.! 

In a simple train of waves in a transparent medium, the potential 
energy, on the elastic theory, may be divided into two parts, of which 
one is due to that general deformation of the ether which is repre¬ 
sented by the equations of wave-motion, and the other to those 
deformations which are caused by the interference of the ponderable 
particles with the wave-motion, and to such displacements of the 
ponderable matter as may be caused, in some cases at least, by the 
motion of the ether. If we write h for the amplitude, I for the wave¬ 
length, and p for the period, these two parts of the statical energy 
(estimated per unit of volume for a space including many wave¬ 
lengths) may be represented respectively by 

ir^Bh^ , bh^ 

_ 7 ^-- -r° 

* This may be more clear if we consider the stationary waves formed by two trains of 
waves moving in opposite directions. The case then comes under the following theorem: 

“ If the system undergo such a change that the potential energy of a given configur¬ 
ation is diminished, while the kinetic energy of a given motion is unaltered, the periods 
of the free vibrations are all increased, and conversely.’’ See Lord Rayleigh’s Theory of 
Sounds vol. i, p. 85. 

i Am. Jour. Sci., ser 3, vol. xxxv, p. 60. 
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The sum of these may be equated to the kinetic energy, giving an 
equation of the form 

p +T=-^- • 

B is an absolute constant (the rigidity of the ether, previously 
represented by the same letter), A' and h will be constant (for the 
same medium and the same direction of the w^ave-normal) except 
so far as the type of the motion changes, i.e., except so far as the 
manner in which the motion of the ether distributes itself between 
the ponderable molecules, and the degree in’which tliese take part 
in the motion, may undergo a change. When the period of vibration 
varies, the type of motion will vary more or less, and A' and h will 
vary more or less. 

Ill a manner entirely analogous,* the kinetic energy, on the elec¬ 
trical theory, may be divided into two parts, of which one is due to 
those general fluxes which are represented by the equations of wave- 
motions, and the other to those irregularities in the fluxes which 
are caused by the presence of the ponderable molecules, as well as to 
such motions of the ponderable particles thernselve^s as may some¬ 
times occur. These parts of the kinetic energy may be represented 
respectively by 

and 

Their sum equated to the potential energy gives 


4 


(25) 


Here F is the constant of electrodynamic induction, which is unity if 
we use the electromagnetic system of units, / and G (like A' and h) 
vary only so far as the type of motion varies. 

We have the means of forming a very exact numerical estimate 
of the ratio of the two parts into which the statical energy is thus 
divided on the elastic theory, or the kinetic energy on the electric 
theory. The means for this estimate is afforded by the principle 
that the period of a natural vibration is stationary when its type is 
infinitesimally altered by any constraint.! Let us consider a case of 
simple wave-motion, and suppose the period to be infinitesimally 
varied: the wave-length will also vary, and presumably to some 
extent the type of vibration. But, by the principle just stilted, if the 
ether or the electricity could be constrained to vibrate in the original 
type, the variiitions of I and p would be the same as in the actual 


** 8ee page 182 of this volume. 

t 8ee Lord Rayleigh’s Theory of Sotindj vol. i, p. 84. The application of the principle 
is most simple in the case of sUvtionary waves. 
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case. Therefore, in finding the differential equation between I and p, 
we may treat h and A! in (24) and / and G in (25) as constant, as well 
as B and F. These equations may be written 

47r*B ^ -f hp^ = 47r*A', 




Differentiating, we get 


4-r®Bd^'= -hdip^), 


irFd- 




or 


47r®B^ d log^ = -~hp^d logp®. 


Hence, if we write V for the wave-velocity ilfp), n for the index of 
refraction, and \ for the wave-length in vacuo, we have for the ratio 
of the two parts into which we have divided the potential energy on 
the elastic theory, 

bh^ ^ _ d log V _ d log n /op. 

4 ’ d logp ~ d log X * 

and for the ratio of the two parts into which we have divided the 
kinetic energy on the electrical theory, 

dlogV dlogn .a>r\ 

p 2 ’ p 2 d logp dlog \ ^ 

It is interesting to see that these ratios have the same value. This 
value may be expressed in another form, which is suggestive of some 
important relations. If we write U for what Lord Rayleigh has 
called the velocity of a group of waves,* 


U_ dlogV 

V”^ dlogr 

dlogV_V~U 
d log I V ’ 

dlogV ^V-U 
dlogp^ U 


(28) 


It appears, therefore, that in the elastic theory that part of the 
potential energy which depends on the deformation expressed 


*S©e his “Note on Progressive Waves,” Proc. Land, McUh, Soc., vol. ix. No. J25, 
reprinted in his Theory of Sound, voL ii, p. 297. 
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by the equations of wave-motion, bears to the whole potential 
ener^ the same ratio which the velocity of a group of waves 
bears to the wave-velocity. In the electrical theory, that part of 
the kinetic energy which depends on the motions expressed by the 
equations of wave-motion bears to the whole kinetic energy the same 
ratio. 

Returning to the consideration of equations (26) and (27), we 
observe that in transparent bodies the last member of these equations 
represents a quantity which is small compared with unity, at least in 
the visible spectrum, and diminishes rapidly as the wave-length 
increases. This is just what we should expect of the first member of 
equation (27). But when we pass to equation (26), which relates to 
the elastic theory, the case is entirely different. The fact that the 
kinetic energy is affected by the presence of the ponderable matter 
and affected differently in different directions, shows that the motion 
of the ether is considerably modified. This implies a distortion super¬ 
posed upon the distortion represented by the equations of wave- 
motion, and very much gi'eater, since the body is very fine-grained as 
measured by a wave-length. With any other law of elasticity, we 
should suppose that the energy of this superposed distortion would 
enormously exceed that of the regular distortion represented by the 
equations of wave-motion. But it is the peculiarity of this new law 
of elasticity that there is one kind of distortion, of which the energy 
is very small, and which is therefore peculiarly likely to occur. Now 
if we can suppose the distortion caused by the ponderable molecules 
to be almost entirely of this kind, we may be able to account for the 
smallness of its energy. We should still expect the first member of 
(26) to increase with the wave-length, on account of the factor i®, 
instead of diminishing, as the last member of the equation shows that 
it does. We are obliged to suppose that 6, and therefore the type of 
the vibrations, varies very rapidly with the wave-length, even in those 
cases which appear farthest removed from anything like selective 
absorption. 

The electrical theory furnishes a relation between the refractive 
power of a body and its specific dielectric capapity, which is com¬ 
monly expressed by saying that the latter is equal to the square of 
the index of refraction for waves of infinite length. No objection can 
be made to this statement, but the great uncertainty in determining 
the index for waves of infinite length by extrapolation prevents it 
from furnishing any very rigorous test of the theory. Yet, as the 
results of ^extrapolation in some cases agree strikingly with the 
specific dielectric capacity, although in other cases they are quite 
different, the correspondence is’generally regarded as corroborative, in 
some degree, of the theory. But the relation between refractive power 
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and dielectric capacity may be expressed in a form which will furnish 
a more rigorous test, as not involving extrapolation. 

We have seen on page 242 how we may determine numerically the 
ratio of the two first terms of equation (25). We thus easily get 
the ratio of the first and hist term, which gives 


4 (I log \ 


( 20 ) 


In the corresponding equation for a train of waves of the same ampli¬ 
tude and period in vacuo, I becomes X, F remains the same, and for G 
we may write G'. This gives 


1 )ividing, we get 


4 ji- 


G'~~f/logX d(x^y 


(30) 

(31) 


Now G' is the dielectric elasticity of pure ether. If K is the 
specific dielectric capacity of the body which we are considering, 
ij/K is the dielectric elasticity of the body and G 72 K is tlie potential 
energy of the body (per unit of volume), due to a unit of ordinary 
electrostatic displacement. But GJdj^ is the potential energy in a 
train of waves of amplitude A. Since the average square of the dis¬ 
placement is A^/2, the potential energy of a unit displacement such as 
occurs in a train of waves is G/2. Now in the electrostatic experi¬ 
ment the displacement distributes itself among the molecules so as to 
make the energy a minimum. But in the case of light the distribu¬ 
tion of the displacement is not determined entirely by statical 
considerations. Hence 


G 

"2 = 2K’ 


(32) 


and 



K 


- W 
= d{v^y 


(33) 


It is to be observed that if we vshould assume for a dispersion-formula 
n"‘^ = a-5X-2, (34) 


1/a, which is the square of the index of refraction for an infinite 
wave-length, would be identical with the second member of (33). 

Another similarity between the electrical and optical properties of 
bodies consists in the relation between conductivity and opacity. 
Bodies in which electrical fluxes are attended with absorption of 
energy absorb likewise the energy of the motions which constitute 
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light. This is strikingly true of the metals. But the analogy does 
not stop here. To fix our ideas, let us consider the case of an 
isotropic body and circularly polarized light, which is geometrically 
the simplest case although its analytical expression is not so simple 
as that of plane-polarized light. The displacement at any point 
may be symbolized by the rotation of a point in a circle. The 
external force necessary to maintain the displacement 5 hs represented 
by transparent bodies, for which is a positive numb€*T, 

the force is radial and in the direction of the displacement, being 
principally employed in counterbalancing the dielectric elasticity, 
which tends to diminish the displacement. In a conductor 
l^ecomes complex, which indicates a component of the force in the 
direction of that is, tangential to the circle. This is only the 
analytical expression of the fact above mentioned. But there is 
another optical peculiarity of metals, which has caused much remark, 
viz., that the real part of n-- (and therefore of n~^) is negative, i.e., 
the radial component of the force is directed towards the center. 
This inwardly directed force, which evidently opposes the electro- 
dynamic induction of the irregular part of the motion, is small 
compared with the outward force which is found in transparent 
bodies, but increases rapidly as the period diminishes. We may say, 
therefore, that metals exhibit a second optical peculiarity,—that the 
dielectric elasticity is not prominent as in transparent bodies. This 
is like the electrical behavior of the metals, in which we do not 
observe any elastic resistance to the motion of electricity. We see, 
therefore, that the complex indices of metals, both in the real and 
the imaginary part of their inverse squares, exhibit properties 
corresponding to the electrical behavior of the metals. 

The case is quite different in the elastic theory. Here the force 
from outside necessary to maintain in any element of volume the 
displacement (£ is represented by In transparent bodies, 

therefore, it is directed toward the center. In metals, there is a 
component in the direction of the motion (5, while the radial part of 
the force changes its direction and is often many times greater than 
the opposite force in transparent bodies. This indicates that in 
metals the displacement of the ether is resisted by a strong elastic 
force, quite enormous compared to anything of the kind in trans¬ 
parent bodies, where it indeed exists, but is so small that it has been 
neglected by most writers except when treating of dispersion. We 
can make these suppositions, but they do not correspond to 
anything which we know independently of optical experiment. 

It is evident that the electrical theory of light has a serious rival, 
in a sense in which, perhaps, one did not exist before the publication 
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of Sir William Thomson’s paper in November last.* Nevertheless, 
neither surprise at the results which have been achieved, nor 
admiration for that happy audacity of genius, which, seeking the 
solution of the problem precisely where no one else would have 
ventured to look for it, has turned half a century of defeat into 
victory, should blind us to the actual state of the question. 

It may still be said for the electrical theory, that it is not obliged 
to invent hypotheses,! but only to apply the laws furnished by 
che science of electricity, and that it is difficult to account for the 
coincidences between the electrical and optical properties of media, 
unless we regard the motions of light as electrical. But if the 
electrical character of light is conceded, the optical problem is very 
different from anything which existed in the time of Fresnel, Cauchy, 
and Green. The third wave, for example, is no longer something 
to be gotten rid of quocunque modo, but something which we must 
dispose of in accordance with the laws of electricity. This would 
seem to rule out the possibility of a relatively small velocity for 
the third wave. 


* “Since the first publication of Cauchy’s work on the subject in 1830, and of Green’s 
in 18.37, many attempts have been made by many workers to find a dynamical 
foundation by Fresnel’s laws of reflexion and refraction of light, but all hitherto 
ineflectually,” Sir William Thomson, he. citat. 

“So far as I am aware, the electric theory ol Maxwell is the only one satisfying those 
conditions (of explaining at once Fresnel’s laws of double refraction in crystals and 
those governing the intensity of reflexion when light passes from one isotropic medium 
to another).” Lord Rayleigh, Phil. Mag.^ September, 1888. 

t Electrical motions in air, since the recent experiments of Professor Hertz, seem to 
be no longer a matter of hypothesis. We can hardly suppose that the case is essentially 
different with the so-called vacuum. The theorem that the electrical motions of light 
are solenoidal, although it is convenient to assume it as a hypothesis and show that the 
results agree with experiment, need not occupy any such fundamental position in the 
theory. It is in fact only another way of saying that two of the Constants of electrical 
science have a certain ratio (infinity). It would be easy to commence without assuming 
this value, and to show in the course of the development of the subject that experiment 
requires it, not of course as an abstract proposition, but in the sense in which 
experiment can be said to require any values of any constants, that is, to a certain 
degree of approximation. 
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REVIEWS OF NEWCOMB AND MICHELSON’S “VELOCITY 
OF LIGHT IN AIR AND REFRACTING MEDIA” AND 
OF KETTELER’S THEORETISOHE OPTIK." 

[American Journal of Science^ ser. 3, vol. xxxi. pp. 62-67, Jan. 1886.] 

Velocity of Light in Air and Refracting Media. 

Astrononomical Papers prepared for the use of the American Ephemeiis a/nd 
Nautical Almanac, vol. ii. parts 3 and 4,* Washington, 1885. 

Professor Newcomb obtains as the final result of his experiments 
at Washington 299,860 i 30 kilometers per second for the velocity of 
light in vacuo. Professor Michelson’s entirely independent experi¬ 
ments at Cleveland give substantially the same result (299,853 ±60) 
His former experiments at the Naval Academy, after correction of 
two small errors which he now reports, give 299,910 ±50. All these 
experiments were made with the revolving mirror, but the arrange¬ 
ments of the two experimenters were in other respects radically 
different. The first of these values of the velocity of light with 
Nyr^n’s value of the constant of aberration (20"'492) gives 149’60 for 
the distance of the sun in millions of kilometers. On acount of the 
recent announcement by Messrs. Young and Forbes of a difference 
of about two per cent, in the velocities of red and blue light, 
especial attention was paid to this point by both experimenters, 
without finding the least indication of any difierence. In Professor 
Newcomb’s experiments, a difference of only one thousandth in these 
velocities would have produced a well-marked iridescence on the 
edges of the return image of the slit formed by reflection from the 
revolving mirror. No trace of such iridescence could ever be seen. 
Professor Michelson made an experiment, in which a red glass covered 
one-half the slit. The two halves of the image—the upper white, 
the lower red—were exactly in line. 

Since Maxwell's electromagnetic theory of light makes the velocity 
of light in air equal to the ratio of the electromagnetic and 

* [Part 3, Measures of the Velocity of Light,” S. Newcomb; part 4, Supple¬ 
mentary Measures of the Velocities of white and colored Light in air, water, 
and carbon disulphide,” A. A. Michelson.] 
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electrostatic units of electricity, it will be interesting to compare 
some recent determinations of this ratio. These we give in the 
following table. Since the determinations are affected by any error 
in the standard of resistance, we have corrected the results, first, on 
the suppasition that the B.A. ohm — *987 true ohms (Lord Rayleigh’s 
result), and secondly, on the supposition that the B.A. ohm == *989 true 
ohms, which is essentially assuming that the legal ohm represents the 
true value. 

Ratio of Electromafjmtic and Eleclrottfatic units of Electricity in millions of 
meters and. seconds. 



Date. 

As published. 

U.A. ohm =.-987. 

B.A. ()hm = -98!\ 

Ayrton & Perrjy,* 

1878 

298-0 

296-1 

296-4 

Hock in, t 

1879 

298-8 

296-9 

297-2 

Shida, X 

1880 

•299-5 

295-6 

296-2 

Exner, § 

1882 

.301-1 (?) 

291-7 (?) 

29-2-3 (?) 

J. J. Thomson, |j 

188.3 

290-.3 

296-.3 

296-9 

Klemen8i6, If 

1884 

301 -88 (?) 

30l-88(?) 

.302'48 (?) 


These numbers are to be compared with the velocity of light in air, in 
millions of meters per second, for which Professor Newcomb gives 
299*778. Of the electrical determinations, that of J. J. Thomson 
appears by far the most worthy of confidence. That of Klemencic— 
the only one as great as the velocity of light—was obtained by the 
use of a condenser with glass,—a method which would presumably 
give too great a ratio. Exner’s value is <^)btained from the mean of 
three determinations, one of which differed from the others by about 
three per cent. If we reject this discordant determination, the mean 
of the other two would give when corrected for resistance 294*4 and 
295*0. If we set aside the determinations of Exner and Klemen^i^i, 
the remaining four, which represent three different methods, are very 
accordant, the mean being nearly identical with the result of J. J, 
Thomson, and about one per cent, less than the velocity of light. 

Professor Michelson’s experiments on the velocity of light in carbon 
disulphide afford an interesting illustration of the difference between 
the velocity of waves and the velocity of groups of waves—a subject 
which is treated at length in an appendix to the second volume of 
Lord Rayleigh’s Theory of Sound, If we write V for the velocity 
of waves, U for that of a gi'oup of waves, L for the wave-length, and 
T for the period of vibration, 



U: 


"d(L-^y 


For purposes of numerical calculation, it will be convenient to 
transform these formulae by the use of X for the wave-length in 


* Phil. Mag.., (5), vol, vii, p. 277. Report Brit. Assoc., 1879, p. 28/). 

X Phil. Mag., (5), vol. x, p. 431. %Sitzungsl>erichte Wien. Alxid., vol. Ixxxvi, p. 106. 

\\Phil. Trans., vol. olxxiv, p. 707. ^ Sitzungeherichfe Wien. Akad., vol. Ixxxix, p. 298. 
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vacwj, n for the index of refraction of the medium considered, and k 
for the velocity of light in vaeno, which we shall regard as constant, 
in accordance with general usage. By substitution of these letters we 
easily obtain 

_ k ^d{n\~^) 

The data for the calculation of these quantities for carbon disulphide 
are given by Verdet {Annale^ de Chimif^ el de Pltysique, (3), vol. Ixix, 
p. 470). They give 

for the line D, kjY '624, hjV) — 1 *722, 
for the line E, kjY = 1 '637, /.yU = 1‘767. 

The quotient of the velocity iv vax^uo divided by the velocity in 
carbon disulphide, according to Professor Miehelson’s experiments 
with the light of an arc lamp, is l’76i:‘02, which agrees very well 
with kjU. Another theory, which would make the velocity observed 
in such experiments {Nature, vol. xxv, p. 52), receives no 

countenance from these experiments. The value of /rU/V^ would 
be about 1'53. Some may think that the experiments on water point 
in a different direction. Taking our data from Beer’s Ehileituvg in 
die hdhe7*e O'ptik, 1853, p. 411, we get 

fori), A'/V=: 1-334, /VU = 1-352, 1'316, 

for E, /o/V=: 1-336, yc/U-1'359, /•U/V^= 1-313. 

The number obtained by experiment was 1,330, which agrees l>etter 
with kjY, or even with k\Jj\-, than with k/V, but the differences are 
here too small to have much significance. 


Theoretische Optik, gegrUndet auf das Bessel-Sellmeier’scne 
Princip, zugleich mit den experimentellen Belegen. 

Von Dr. E. Ketteler, Professor an der Universitat in Bonn. Viewig 
und Sohn. Braunschweig, 1885. 

The principle of Sellmeier, here referred to, relates to vibrations of 
ponderable particles excited by the etherial vibrations of light, and to 
the reaction of the former upon the latter. The name of Bessel is 
added on account of his previous solution of a somewhat analogous 
problem relating to the pendulum. The object of this work is “to 
treat theoretical optics in a complete and uniform manner on the new 
foundation of the simultaneous vibration of etherial and ponderable 
particles, and to substitute a consistent and systematic new structure 
for the present conglomei-ate of more or leas disconnected principles.’’ 
Such a work demands a critical examination, which should not be 
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undertaken from any narrow point of view. Any faults of detail will 
be readily forgiven, if the author shall give the theory of optics 
the TTov crrw which it has sought so long in vain. We may add that 
if this effort shall not be judged successful by the scientific world, 
the author will at least have the satisfaction of being associated in his 
failure with many of the most distinguished names in mathematical 
physics. 

We have sought to test the proposed theory with respect to that 
law of optics which seems most conspicuous in its definite mathe¬ 
matical form, and in the rigor of the experimental verifications to 
which it has been subjected, as well as in the magnificent develop¬ 
ments to which it has given rise: the law of double refraction due to 
Huyghens and Fresnel, and geometrically illustrated by the wave- 
surface of the latter. We cannot find that the law of Fresnel is 
proved at all in this treatise. We find on the contrary, that a law is 
deduced which is different from Fresnels, and inconsistent with it. 
We do not refer to anything relating to the direction of vibration of 
the rays in a crystal, which is a point not touched by the experi¬ 
mental verifications to which we have alluded. We shall confine 
our comparison to those equations from which the direction of 
vibration has been eliminated, and which therefore represent relations 
subject to experimental control. For this purpose equation (13) on 
page 299 is suitable. It reads 


U' 

nl^n^ ti J nl — 


== 0 , 


Uy, Vg being the principal indices of refraction. This the author 
calls the equation of the wave-surface or surface of ray-velocities. 
It has the form of the equation of Fresnel’s wave-surface, expressed 
in terms of the direction-cosines and reciprocal of the radius vector, 
and if u, Vj w are the direction-cosines of the ray, and n the velocity 
of light in vax;uo divided by the so-called ray-velocity in the crystal 
the equation will express FresneFs law. But it is impossible to give 
these meanings to u, v, w and n. They are introduced into the 
discussion in the expression for the vibrations (p. 295), viz,. 




The form of this equation shows that u, v, w are proportional to the 
direction-cosines of the wave-normal, and as the relation 
is afterwards used, they must be the direction-cosines of the wave- 
normal. They cannot possibly denote the direction-cosines of the ray, 
except in the particular case in which the ray and wave-normal 
coincide. Again, from the form of this equation, Xjn must be the 
wave-length in the crystal, and if X here as elsewhere in the book 
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(see p. 26) denotes the wave-length in vacuo of light of the period 
considered, which we doubt not is the intention of the author, n must 
be the wave-length in vacuo divided by the wave-length in the 
crystal, i^., the velocity of light in vacuo divided by the wave-velocity 
in the crystal. With these definitions of u, v, w, and n, equation (13) 
expresses a law which is different from Fresnel’s. Applied to the 
simple case of a uniaxial crystal, it makes the relation between the 
wave-velocity of the extraordinary ray and the angle of the wave- 
normal with the principal axis the same as that of the radius vector 
and the angle in an ellipse. The law of Huyghens and Fresnel makes 
the reciprocal of the wave-velocity stand in this relation. 

The law which our author has deduced has come up again and 
again in the history of theoretical optics. Professor Stokes {Report 
of the British Assoc., 1862, part i, p. 269) and Lord Rayleigh {Phil. 
Mag., (4), vol. xli, p. 525) have both raised the question whether 
Huyghens and Fresnel might not have been wrong, and it might not 
be the wave-velocity and not its reciprocal which is represented by 
the radius vector in an ellipse. The difference is not very great, for 
if we lay off on the radii vectores of an ellipse distances inversely 
proportional to their lengths, the resultant figure will have an oval 
form approaching that of an ellipse when the eccentricity of the 
original ellipse is small. Rankine appears to have thought that the 
difference might be neglected (see Phil. Mag., (4), vol. i, pp. 444, 445) 
at least he claims that his theory leads to Fresnel’s law, while really 
it would give the same law which our author has found. (Concerning 
Rankine’s “ splendid failure,” and the whole history of the subject, see 
Sir Wm. Thomson’s LeHures on Molecular Dynamics at the Johns 
Hopkins University , chap, xx.) Professor Stokes undertook experi¬ 
ments to decide the question. His result, corroborated by Glazebrook 
{Pro. Roy. Soc., vol. xx, p. 443; Phil. Trans., vol. clxxi, p. 421), was 
that Huyghens and Fresnel were right and that the other law was 
wrong. 

To return to our author; we have no doubt from the context that 
he regards u, v, w, and n as relating to the ray and not to the wave- 
normal. We suppose that that is the meaning of his remark that 
the expressictn for the vibrations (quoted above) is to be referred to 
the direction of the ray. It seems rather hajrd not to allow a writer 
the privilege of defining his own terms. Yet the reader will admit 
that when the vibrations have been expressed in the above form an 
inexorable necessity fixes the significance of the direction determined 
fiy V, w, and leaves nothing in that respect to the choice of the 
author. 

The historical sketches of the development of ideas in the theory 
of optics, enriched by very numerous references, will be useful to 
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the student. An exception, however, must be made with respect to 
the statements concerning the electromagnetic theory of light. We 
are told (p. 450) that the English theory, founded by Maxwell and 
represented by Glazebrook and Fitzgerald, makes the plane of polari¬ 
zation coincide with the plane of vibration, while Lorentz, on the 
basis of Helmholtzs equations comes to the conclusion that these 
planes are at right angles. Since all these writers make the electrical 
displacement perpendicular to the plane of polarization, we can only 
attribute this statement to some confusion between the electrical 
displacement and the magnetic force or displacement" at right angles 
to it. We are also told that Glazebrook’s “surface-conditions ” which 
determine the intensity of reflected and refracted light are different 
from those of Lorentz,—a singular error in view of the fact that 
Mr. Glazebrook (Proc. Camh. PhiL vol. iv, p. 166) expressly 
states that his results are the same as those of Lorentz, Fitzgerald, 
and J. J. Thomson. We have spent much fruitless labor in trying to 
discover where and how the expressions were obtained which are 
attributed to Glazebrook, but in wliich the notation has been altered. 
They ought to come from Glazebrook’s equations (24)-(27) {loc. 
but these appear identical with Lorentz’s equations (58)-(61) {ZciP 
schrift f. Math. u. Phys., vol xxii, p. 27). 'J'hey might be obtained by 
interchanging the expressions for vibrations in the plane of incidence 
and at right angles to it, with two changes of sign. 

The reader must be especially cautioned concerning the statements 
and implications of what has not been done in the electromagnetic 
theory. These are such as to suggest the question whether the author 
has taken the trouble to read the titles of the papers which have 
been published. We refer especially to what is said on pages 248, 
249 concerning absorption, dispersion, and the magnetic rotation of 
the plane of polarization. 

In the Experimental Part, with which the treatise closes, we have 
a comparison of fonnulaa with the results of experiments by the 
author and others. The author has been particularly successful in 
the formula for dispersion. In the case of quartz (p. 545), the 
formula (with four constants) represents the results of experiment in 
a manner entirely satisfactory through the entire range of wave¬ 
length from 2T4 to 0*214. Those who may not agree with the 
author 8 theoretical views will nevertheless be glad to see the results 
of experiment brought together, and, so far as may be, represented by 
formuliB. 
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ON TJEJE VELOCITY OF LIGHT AS DETERMINED BY 
FOUCAULT’S REVOLVING MIRROR. 


vol. XXXIII. p. 582, April 22, 1886.] 


It has been shown by Lord Rayleigh and others that the velocity 
( U ) with which a group of waves is propagated in any medium may 
be calculated by the formula 


where V is the wave-velocity, and \ the wave-length. It has also 
been observed by L<:)rd Rayleigh that the fronts of the waves reflected 
by the revolving mirror in Foucault’s experiment are inclined one to 
another, and in con.se<iuence must rotate with an angular velocity 


(l}[ 

d\ 


a 


where a is the angle between two successive wave-planes of similar 
phase. When (lVjd\ is po.sitive (the usual case), the direction of 
rotation is such that the following wave-plane rotates towards the 
position of the preceding (see Nature^ vol. xxv. p. 52). 

But I am not aware that attention has been called to the irnportanc 
fact, that while the individual wave rotates the wave-normal of the 
group remains unchanged, or, in other words, that if we tix our 
attention on a point moving with the group, therefore with the 
velocity U, the succes.sive wave-planes, as they pass through that 
point, have all the same orientation. This follows immediately from 
the two formulae quoted above. For the interval of time between 
the arrival of two successive wave-planes of similar phase at the 
moving point is evidently X/(F—i[7), which reduces by the flrst for¬ 
mula to d\jdV, In this time the second of the wave-planes, having 
the angular velocity adVjdX, will rotate through an angle a towards 
the position of the tirst wave-plane. But a is the angle between the 
two planes. The second plane, therefore, in passing the moving point, 
will have exactly the same orientation which the first had. To get a 
picture of the phenomenon, we may imagine that we are able to see 
a few inches of the top of a moving carriage-wheel. The individual 
spokes rotate, while the group maintains a vertical direction. 
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This consideration greatly simplifies the theory of Foucault’s 
experiment, and makes it evident, I think, that the results of all 
such experiments depend upon the value of U, and not upon that of F. 

The discussion of the experiment by following a single wave, and 
taking account of its rotation, is a complicated process, and one in 
which it is very easy to leave out of account some of the elements of 
the problem. The principal objection to it, however, is its unreality. 
If the dispersion is considerable, no wave which leaves the revolving 
mirror will return to it. The individual disappears, only tShe group 
has permanence. Prof. Schuster in his communication of March 11 
(p. ^), has nevertheless obtained by this method, as the quantity 
determined by the experiments hitherto performed,” F*/(2 V — U)y 
which, as he observes, is nearly equal to U. He would, I think, 
have obtained U precisely, if for the angle between two successive 
wave-planes of similar phase, instead of 2w\IV, he had used the more 
exact value 2iyX/J7. 

By the kindness of Prof. Michelson, I am informed with respect to 
his recent experiments on the velocity of light in bisulphide of carbon 
that he would be inclined to place the maximum brilliancy of the 
light between the spectral lines D and E, but nearer to D. If we 
take the mean between D and E, we have 

f=l-745, 

K denoting the velocity in vacw (see p. 249 of this volume). The 
number observed was 1*76, “ with an uncertainty of two units in the 
secpnd place of decimals.” This agrees best with the first formula. 
The same would be true if we used values nearer to the line D. 

J. Willard Gibbs. 

New Haven, Connecticut, April 1. [1886.] 
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VELOCITY OF PROPAGATION OF ELECTROSTATIC FORCE. 

\Nature, vol. Lill, p. 609, 2, 1896.] 

As we may have to wait some time for the experimental solution 
of Lord Kelvin's very instructive and suggestive problem concerning 
two pairs of spheres charged with electricity (see Natwre of February 
6, p. 316), it may be interesting to see what the solution would be 
from the standpoint of existing electrical theories. 

In applying Maxwell’s theory to the problem it will be convenient 
to suppose the dimensions of both pairs of spheres very small in 
comparison with the unit of length, and the distance between the 
two pairs very great in comparison with the same unit. These 
conditions, which greatly simplify the equations which represent the 
phenomena, will hardly be regarded as affecting the essential nature 
of the question proposed. 

Let us first consider what would happen on the discharge of (A, B), 
if the system (c, d) were absent. 

Let t/Iq be the initial value of the moment of the charge of the 
system (A, B), (this term being used in a sense analogous to that 
in which we speak of the moment of a magnet), and m the value 
of the moment at any instant. If we set 

m = F(0, (1) 

and suppose the discharge to commence when ^ = 0, and to be 
completed when t — h, we shall have 

F(G = wio when <<0. (2) 

and F(^) = 0 when t>h, (3) 

Let us set the origin of coordinates at the centre of the system 
(A, B), and the axis of x in the direction of the centre of the 
positively charged sphere. A unit vector in this direction we shall 
call and the vector from the origin to the point considered p. 
At any point outside of a sphere of unit radius about the origin, 
the electrical displacement (£)) is given by the vector equation 

47r® = [3r “ «F(^ - cr) -f 3cr " ^Y{t - cr) -f “ W'\t - cr)] xp 

— [r”®F(^ —cr)-f — cr)-hcV"^F"(^ —cr)]i, (4) 

where F denotes the function determined by equation (1), F' and F' 
its derivatives, and c the ratio of electrostatic and electromagnetic 
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units of electricity, or the recipr<x5al of the velocity of light. For 
this satisfies the general equation 

= (5) 

as well as the so-called “equation of continuity,’' and also satisfies 
the vspecial conditions that when ^<0 

—— r~H) 

outside of the unit sphere, and that at any time at the surface of 
this sphere 47 r 15 = m(3*p-i), 

if we consider the terms containing the factor c as negligible, when 
not compensated by large values of r. That equation (4) satisfies 
the genera] conditions is easily verified, if we set 

qt = r ~ (t — cr), (6) 

and observe that 

— yhi — chihi/'df'^, (7) 

and that the three components of T) are given by the equations 
4x/=: — dHijdy‘^ — dhi/dz- \ 

•^Trg —dhijdxdy I (8) 

47rA = dhi/dxdz j 


Equation (4) shows that the changes of the electrical displacement 
are represented by three systems of splierical waves, of forms 
determined by the rapidity of the dischaige of the system (A, B), 
which expand with the vehxdty of light with amplitudes diminishing 
as r"^, and \ respectively. Outside of these waves, tlio electrical 
displacement is unchanged, inside of them it is zero. 

If we write (with Maxwell) — for the force of clectrodynamic 
induction at any point, and suppose its rectangular components 
calculated from those of — d^X^/df^ by the formula used in calculating 
the potential of a mass from its density, we shall have by Poisson’s 
theorem 

V\d%ldt) = A>iriPXIdt:\ 
or by (5), V-(d'2l/c?0= 

whence d^jdt— — 47rc"^T). (9) 

From this, with (4), and the general equation 

we see that during the discharge of the system (A, B) the electrostatic 
force — VV vanishes throughout all space, while its place is taken by 
a precisely equal electrodynamic force --d^jdt. 

This electrodynamic force remains unchanged at every point until 
the passage of the waves, after which the electrostatic force, the 
electrodynamic force, and the displacement, have the permanent value 
zero. 
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If we write Curl for the diHerentiating vector operator which 
Maxwell calls by that name, equations (8) may be put in the form 
4'7rX) == Curl Curl ( m), 

whence dX)l<U = (47r)' ^ Curl Curl {i duldt). 

From d^'jdt we may calculate the magnetic induction ® by an 
operation which is the inverse of (47r)~^Curl. We have therefore 
^ = Curl {i duldt), 

or ^ = [r " F' (^ — cr) -f or ~ ^ ¥"( t — rr )] ( yh — zj ). 

The magnetic induction is therefore zero except in the waves. 

Ecpiations (4) and (9) give the value of d^jdi as function of t and 
r. By integration, we may find the value of 51, Maxwell’s “vector 
potential.” This will be of the form of the second meml)er of (4) 
multiplied by if we should give each F one accent less, and for 

an unaccented F should write Fj, to denote the primitive of F which 
v^anishes for the argument x . 

That which seenis most worthy of notice is that although simul¬ 
taneously with the discharge of the system (A, B) the values of what 
we call the electric jK)tential, the electrodynamic force of induction, 
and the “ vector potential,” are changed throughout all space, this does 
not appear connected with any physical change outside of the waves, 
which advance with the velocity of light. 

If we now suppose that there is a second pair of charged spheres 
(c, d), as in the original problem, the discharge of this pair will 
evidently occur when the relaxation of electrical displacement reaches 
it. The time between the discharges is, therefore, by Maxwell’s 
theory, the time retjuired for light to pass from one pair to the other. 

It may also be interesting to observe that in the axis of x, on both 
sides of the origin, xp — r^i, and equation (4) reduces to 
47rT) = [ 27' “ ‘'^F(f — cr) -f- 2cr “ ^F^^ — cr)] i. 

Here, therefore, the oscillations are normal to the wave-surfaces. 
This might seem to imply that plane waves of normal oscillations 
may be propagated, since we are accustomed to regard a part of an 
infinite sphere as equivalent to a part of an infinite plane. Of course, 
such a result would be contrary to Maxwell’s theory. The paradox 
is explained if we consider that the parts of the )vave-motion, 
expressed by F and F', diminish more rapidly than those expressed by 
F", so that it is unsafe to take the displacements in the axis of a; as 
approximately representing those at a moderate distance from it. In 
fact, if we consider the displacements not merely in the axis of x, but 
within a cylinder about that axis, and follow the waves to an infinite 
distance from the origin, we find no approximation to what is usually 
meant by plane waves with normal oscillations. 

J. Willard Gibbs. 

New Haven, Conn., March 12 [1896]. 
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FOURIER’S SERIES. 

[Nature, vol. Lix, p. 200, Dec. 29, 1898.] 

I SHOULD like to add a few words concerning the subject of Prof. 
Michelson’s letter in Nature of October 6. In the only reply which 
I have seen {Nature, October 13), the point of view of Prof. Michelson 
is hardly considered. 

Let us write fn{^) for the sum of the first n terms of the series 
sin J sin 2a; -f sin 3x ~ J sin 4sx 4- etc. 

I suppose that there is no question concerning the form of the curve 
defined by any equation of the form 

y=2fn(x). 

Let us call such a curve Cn- As n increases without limit, the 
curve approaches a limiting form, which may be thus described. Let 
a point move from the origin in a straight line at an angle of 45® 
with the axis of X to the point (tt, tt), thence vertically in a straight 
line to the point {tt, — tt), thence obliquely in a straight line to the 
point (Stt, tt), etc. The broken line thus described (continued 
indefinitely forwards and backw^ards) is the limiting form of the 
curve as the number of terms increases indefinitely. That is, if any 
small distance d be first specified, a number n may be then specified, 
such that for every value of n greater than n', the distance of any 
point in C„ from the broken line, and of anj^ point in the broken line 
from C„, will be less than the specified distance d. 

But this limiting line is not the same as that expressed by the 
equation y = limit 2/..(a;). 

nssoo 

The vertical portions of the broken line described above are 
wanting in the locus expressed by this equation, except the points in 
which they intersect the axis of X. The process indicated in %he 
last equation is virtually to consider the intersections of Cn with fixed 
vertical transversals, and seek the limiting positions when n is 
increased without limit. It is not surprising that this process does 
not give the vertical portions of the limiting curve. If we should 
consider the intersections of Cn with horizontal transversals, and 
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seek the limits which they approach when n is increased indefinitely 
we should obtain the vertical portions of the limiting curve as well as 
the oblique portions. 

It should be observed that if we take the equation 

y = 2Mx), 

and proceed to the limit for n = oo, we do not necessarily get y = 0 for 
x = Tr. We may get that ratio by first setting aj = -tt, and then passing 
to the limit. We may also get y = 1, a; =-tt, by first setting y = l, and 
then passing to the limit. Now the limit represented by the equation 
of the broken line described above is not a special or partial limit 
relating solely to some special method of passing to the limit, but it 
is the complete limit embracing all sets of values of x and y which 
can be obtained by any process of passing to the limit. 

J. Willard Gibbs. 

New Haven, Conn., November 29 [1898]. 


[Nature, vol. Lix, p. 606, April 27, 1899.] 

I should like to correct a careless error which I made {Nature, 
December 29, 1898) in describing the limiting form of the family of 
curves represented by the equation 

t/ = 2^8ina; —gsin2£C ... (1) 

as a zigzag line consisting of alternate inclined and vertical portions. 
The inclined portions were correctly given, but the vertical portions, 
which are bisected by the axis of X, extend beyond the points where 
they meet the inclined portions, their total lengths being expressed 
by four times the definite integral 

Jo U 

If we call this combination of inclined and vertical lines C, and 
the graph of equation (1) Cn, and if any finite distance d be specified, 
and we take for n any number greater than 100/c^*, the distance of 
every point in Cn from C is less than d, and the distance of every 
point in C from Cn is also less than d. We may therefore call C the 
limit (or limiting form) of the sequence of curves of which Cn is the 
general designation. 

But this limiting form of the graphs of the functions expressed 
by the sum (1) is different from the graph of the function expressed by 
the limit of that sum. In the latter the vertical portions are wanting, 
except their middle points. 
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I think this distinction important, for (with exception of what 
relates to my unfortunate blunder described above) whatever differ¬ 
ences of opinion have been expressed on this subject seem due, for the 
most part, to the fact that some writers have had in mind the limit 
of the graphs, and others the graph of the limit of the sum. A mis¬ 
understanding on this point is a natural consequence of the usage 
which allows us to omit the word limit in certain connections, as 
when we speak of the sum of an infinite series. In terms thus 
abbreviated, either of the things which I have sought to distinguish 
may be called the graph of the sum of the infinite series. 

J. Willard Gibrs. 

New Haven, April 12 [1899]. 



XX. 


RUDOLF JULIUS EMANUEL CLAUSIUS. 

[Proceedings of the American Academy^ new series, vol. xvi, pp. 458-465, 1889.] 

Rudolf Julius Emanuel Clausius was born at Cbslin in Pomer¬ 
ania, January 2, 1822. His studies, after 1840, were pursued at Berlin, 
where he became Privat-docent in the University, and Instructor in 
Physics in the School of Artillery. He was Professor of Physics 
at Zurich in the Poly tech nicum (1855-67) and in the University 
(1857-67), at Wurzburg (1867-69), and finally at Bonn (1869-88), 
where he died on the 24th of August, 1888. 

His literary activity commenced in 1847, with the publication of 
a memoir in CreUe's Journal^ '' Ueber die Lichtzerstreuung in der 
Atmosphare, und liber die Intensitat des durch die Atmosphare 
refleetirten Sonnenlichts.”** This was immediately followed by other 
writings relating to the same subject, two of which were subsequently 
translated from Poggendorffn Annalen t for Taylor’s Scientific 
Mejrioirs. A treatise entitled “ Die Lichterscheinungen der Atmo¬ 
sphare ” formed part of Gmnert’s “ Beitrage zur meteorologischen 
Optik.” 

An entirely different subject, the elasticity of solids, was discussed 
in his paper (1849), “ Ueber die Veranderungen, welche in den bisher 
gebrauchlichen Forrneln fUr das Gleichgewicht und die Bewegung fester 
Kbrper durch neuere Beobachtungen nothwendig geworden sind.” { 

But it was with questions of quite another order of magnitude that 
his name was destined to be associated. The fundamental questions 
concerning the relation of heat to mechanical effect, which had been 
raised by Rumford, Carnot, and others, to meet with little response, 
were now everywhere pressing to the front. 

'‘For more than twelve years,” said Regnault in 1853, “I have 
been engaged in collecting the materials for the solution of this ques¬ 
tion:—Given a certain quantity of heat, what is, theoretically, the 
amount of mechanical effect which can be obtained by applying the 
heat to evaporation, or the expansion of elastic fluids, in the various 
circumstances which can be realised in practice ? ” § The twenty-first 

* Vof. xxxiv, p. 122, aild vol. xxxvi, p. 185. + Vol. Ixxvi, pp. 161 and 188. 

XPogg. Ann., vol. Ixxvi, p. 46 (1849). § Comptes Pendua, vol. xxxvi, p. 676. 



262 


RUDOLF JULIUS EMANUEL CLAUSIUS. 


volume of the Memoirs of the Academy of Paris, describing the first 
part of the magnificent series of researches which the liberality of the 
French government enabled him to carry out for the solution of this 
question, was published in 1847. In the same year appeared Helm¬ 
holtz's celebrated memoir, “Ueber die Erhaltung der Kraft.” For 
some years Joule had been making those experiments which were 
to associate his name with one of the fundamental laws of thermo¬ 
dynamics and one of the principal constants of nature. In 1849 he 
made that determination of the mechanical equivalent of heat by the 
stirring of water which for nearly thirty years remained the unques¬ 
tioned standard. In 1848 and 1849 Sir William Thomson was engaged 
in developing the consequences of Carnot’s theory of the motive 
power of heat, while Professor James Thomson in demonstrating the 
effect of pressure on the freezing point of water by a Carnot’s cycle, 
showed the flexibility and the fruitfulness of a mode of demonstration 
which was to become canonical in thermodynamics. Meantime 
Rankine was attacking the problem in his own way, with one of 
those marvellous creations of the imagination of which it is so 
difficult to estimate the precise value. 

Such was the state of the question when Clausius published his 
first memoir on thermodynamics: “Ueber die bewegende Kraft der 
Warme, und die Geaetze, welche sich daraus fUr die Warmelehre selbst 
ableiten lassen.” * 

This memoir marks an epoch in the history of physics. If we say, 
in the words used by Maxwell some years ago, that thermodynamics 
is “a science with secure foundations, clear definitions, and distinct 
boundaries," t and ask when those foundations were laid, those defini¬ 
tions fixed, and those boundaries traced, there can be but one answer. 
Certainly not before the publication of that memoir. The materials 
indeed existed for such a science, as Clausius showed by constructing 
it from such materials, substantially, as had for years been the com¬ 
mon property of physicists. But truth and error were in a confusing 
state of mixture. Neither in France, nor in Germany, nor in Great 
Britain, can we find the answer to the question quoted from Regnault. 
The case was worse than this, for wrong answers were confidently 
urged by the highest authorities. That question was completely 
answered, on its theoretical side, in the memoir of Clausius, and the 
science of thermodynamics came into existence. And as Maxwell said 
in 1878, so it might have been said at any time since the publication 
of that memoir, that the foundations of the science were secure, its 
definitions clear, and its boundaries distinct. 

* Read in the Berlin Academy, February 18, 1850, and publiihed in the March and 
April numbers of Poggfndorff^& AnneUen. 

tiVflUttre, vol. xvii, p. 257. 
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The constructive power thus exhibited, this ability to bring order 
out of confusion, this breadth of view which could apprehend one 
truth without losing sight of another, this nice discrimination to 
separate truth from error,—these are qualities which place the 
possessor in the first rank of scientific men. 

In the development of the various consequences of the funda¬ 
mental propositions of thermodynamics, as applied to all kinds of 
physical phenomena, Clausius ’was rivalled, perhaps surpassed, in 
activity and versatility by Sir William Thomson. His attention, 
indeed, seems to have been less directed toward the development of 
the subject in extension, than toward the nature of the molecular 
phenomena of which the laws of thermodynamics are the sensible 
expression. He seems to have very early felt the conviction, that 
behind the second law of thermodynamics, which relates to the heat 
absorbed or given out by a body, and therefore capable of direct 
measurement, there was another law of similar form but relating to 
the quantities of heat (i.e., molecular vis viva) absorbed in the 
performance of work, external or internal. 

This may be made more definite, if we express the second law in 
a mathematical form, as may be done by saying that in any reversible 
cyclic process which a body may undergo 



where dQ is an elementary portion of the heat imparted to the body, 
and t the absolute temperature of the body, or the portion of it which 
receives the heat. Or, without limitation to cyclic processes, we 
may say that for any reversible infinitesimal change, 

dQ = t dSy 

where S denotes a certain function of the state of the body, called by 
Clausius the entropy. The element of heat may evidently be divided 
into two parts, of which one represents the increase of molecular 
vis viva in the body, and the other the wrork done against forces, 
either external or internal. If we call these parts dH and we 

dQ = dH+dQ,. 

Now the proposition of which Clausius felt so strong a conviction 
was that for reversible cyclic processes 

and that for any reversible infinitesimal change 

dQ,„ — tdZy 

where Z is another function of the state of the body, which he 
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called the disgregation, and regarded as determined by the positions 
of the elementary parts of the body without reference to their veloci¬ 
ties. In this respect it differed from the entropy. An immediate 
consequence of these relations is that for any reversible cyclic process 

and therefore that H, the molecular vi^ viva of the lx>dy, must be 
a function of the temperature alone. This important result was 
exprevssed by Clausius in the following words: “ Die Menge der in 
einem Kdrper wirklich vorhandenen Warme ist nur von seiner 
Temperatur und nicht von der Anordnung seiner Bestandtheile 
abhangig.” 

To return to the e<^nation 

= t dZ. 

This expresses that heat tends to increase the disgregation, and that 
the intensity of this tendency is proportional to the absolute tempera¬ 
ture. In the words of Clausius: “ Die mechanische Arbeit, welche 
die Warme bei irgend einer Anordnungsanderung eines Kdrpers thun 
kann, ist proportional der absoluten Temperatur, bei welcher die 
Aenderung geschieht.” 

Such in brief and in part were the vdews advanced by Clausius in 
1862, in his memoir, “Ueber die Anwendung des Satzes von der 
Aequivalenz der Verwandlungen auf die innere Arbeit.”* Although 
they were advanced rather as a hypothesis than as anything for 
which he could give a formal proof, he seems to have little doubt of 
their correctness, and his confidence seems to have increased with the 
course of time. 

The substantial correctness of these views cannot now be called in 
question. The researches especially of Maxwell and Boltzmann have 
shown that the molecular via viva is proportional to the absolute 
temperature, and Boltzmann has even l)een able to determine the 
precise nature of the functions which Clausius called entropy and 
disgregation.f But the anticipation, to a certain extent, at so early a 
period in the history of the subject, of the ultimate form which the 
theory was to take, shows a remarkable insight, which is by no 
means to be lightly esteemed on account of the acknowledged want of 
a rigorous demonstration. The propositions, indeed, as relating to 
quantities which escape direct measurement, belong to molecular 
science, and seem to require for their complete and satisfactory 
demonstration a considerable development of that science. This 

* Pogg. Aniu^ vol. oxvi, p. 73. See also vol. cxxvii, p. 477 (1866). 
f Sitzungsberichie Wien. Akad., vol. Ixiii, p. 728 (1871). 
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development naturally commenced with the simplest ca^e involving 
the characteristic problems of the subject,—the case, namely, of gases. 

The origin of the kinetic theory of gases is lost in remote antiquity, 
and its completion the most sanguine cannot hope to see. But a 
single generation has seen it advance from the stage of vague sur¬ 
mises to an extensive and well established body of doctrine. This is 
mainly the work of three men, Clausius, Maxwell, and Boltzmann, of 
whom Clausius was the earliest in the field, and has been called by 
Maxwell the principal founder of the science.* We may regard his 
paper (1857), "Ueber die Art der Bewegung, welche wir Warme 
nennen,”t as marking his definite entrance into this field, although 
many points were incidentally discussed in earlier papers. 

This was soon followed by his papers, Ueber die mittlere Lange 
der Wege, welche }:)ei der Molecularl)ewegung gasfdrmiger Kdrper 
von den einzelneii Moleclilen zuriickgelegt werden,”^ and “ Ueber die 
Warmeleitnng gjisforrniger Koi-per.”i^ 

A very valuable contribution to molecular science is the conception 
of tlie defined in his paper (1870), “Utd)er einen auf die Warme 

anwimdbaren Satz,”ii where he shows that in any case of stationary 
motion the mean vis viva of the system is equal to its virial. 

In tlie mean time. Maxwell and Boltzmann had entered the field. 
Maxwell’s first paper, “On the Motions and Collisions of perfectly 
elastic Spheres,”^! was characterized by a new manner of proposing 
the problems of molecular science. Clausius was concerned with the 
mean values of various quantities which vary enormously in the 
smallest time or space which we can appreciate. Maxwell occupied 
himself with the relative frequency of the various values which these 
quantities have. In this he was followed by Boltzmann. In reading 
Clausius, we seem to be reading mechanics; in reading Maxwell, and 
in much of Boltzmann’s most valuable work, we seem rather to be 
reading in the theory of probabilities. There is no doubt that the 
larger manner in which Maxwell and Boltzmann proposed the 
problems of molecular science enabled them in some cases to get 
a more satisfactory and complete answer, even for those questions 
which do not at first sight seem to require so broad a treatment. 

Boltzmann’s first work, however (1866), “Ueber die mechanische 
Bedeutung des zweiten Hauptsatzes der Marmetheorie.”** was in a 
line in which no one had preceded him, although he was followed by 

* Naturty vol. xvii, p. 278. 

t Pogg. Aim.y vol. c, p. :I58 (1857). 

tihid.y vol. cv, p, 239 (1858). See also Wied. Ann.y vol. x, p. 92. 

§/6ic?., vol. cxv, p. 1 (1862). 

II Ibid.y vol. cxli, p. 124. See also JubelbaTidy p. 411. 

^PhU. Mag,y vol. xix, p. 19 (1860). 

** Sitzutigsheric}d€ Wien. Akad. vol. liii, p. 195. 
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some of the most distinguished names among his contemporaries. 
Somewhat later (1870) Clausius, whose attention had not been called 
to Boltzmann’s work, wrote his paper, “ Ueber die Zuriickfiihrung des 
zweiten Hauptsatzes der mechanischen Warmetheorie auf allgemeine 
mechanische Principien.” * * * § 

The point of departure of these investigations, and others to which 
they gave rise, is the consideration of the mean values of the force- 
function and of the viva of a system in which the motions are 
periodic, and of the variations of these mean values when the external 
influences are changed. The theorems developed belong to the same 
general category as the principle of least action, and the principle 
or principles known as Hamilton’s, wdiich have to do, explicitly or 
implicitly, with the variations of these mean values. 

Among other papers of Clausius on this subject, we may mention 
the two following: “ Ueber einen neuen mechanischen Satz in 
Bezug auf stationare Bewegung”t (1878), and “Ueber den Satz vom 
mittleren Ergal und seine Anwendung auf die Molecularbewegungen 
der Case ” J (1874). 

The first problem of molecular science is to derive from the observed 
properties of bodies as accurate a notion as possible of their molecular 
constitution. The knowledge we may gain of their molecular con¬ 
stitution may then be utilized in the search for formulas to represent 
their observable properties. A most notable achievement in this 
direction is that of van der Waals, in his celebrated memoir, “ On the 
Continuity of the Gaseous and Liquid States.” To this part of the 
subject belong the following papers of Clausius: “Ueber das Verhalten 
der Kohlensaure in Bezug auf Druck, Volurnen und TeTnperatur,”§ 
and “ Ueber die theoretische Bestimmung des Dampfdruckes und der 
Volumina des Darnpfes und der FlUssigkeit ” {two papers). H 

Another matter in which Clausius showed his originality and 
power was the v'cxed subject of electrodynamics, as treated in his 
memoir, “ Ueber die Ableitung eines neuen electrodynamischen 
Grundgesetzes.” II Various points in the theory of electricity in 
which the principles of thermodynamics or of molecular science 
were involved, had previously been treated in different papers, of 
which the earliest appeared in 1852,** while the doctrine of the 

* PoQg, Ann., vol. cxlii, p, 433. 

■flbid., vol. cl, p. 106. 

Xlbid., Erganzungsband vii, p. 216. 

§ Wied. Ann., vol. ix, p. 337 (1880). 

II Ihid.y vol. xiv, p. 279 and p. 692 (1881). 

Crelle'a Journal, vol. Ixxxii, p. 85 (1877). 

** “Ueber das mechanische Aequivalent einer electrischen Entladung und die dabei 
Btattfindende Envarmung des Leitungsdrahies.” Pogg. Ann., vol. Ixxxvi, p. 337. 
“Ueber die bei eincm stationaren electrischen Strome in dem Leiter gethane Arbeit 
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potential (electrical and gravitational) was treated in a separate book, 
which appeared in 1859, with the title, “Die Potentialfunction und 
das Potential, ein Beitrag zur mathematischen Physik.’’ This 
subsequently went through several editions, in which it wae revised 
and enlarged. All these subjects, with others, were brought together 
in a single volume, “Die mechanische Behandlung der Electricitat,” 
which appeared in 1879, forming the second volume of his 
“Mechanische Warraetheorie.’’* Later papers on electricity related 
to the principles of electrodynamics, t electrical and magnetic units, L 
and dynamo-electric machines. § 

The Royal Society’s catalogue of scientific papers, and the excellent 
indices to the Annalen der Fhyaik UTvd Chemie, in which Clausius’s 
work usually appeared, render it unnecessary to enumerate in detail 
his scientific papers. The list, indeed, would be a long one. The 
Royal Society’s catalogue gives seventy-seven titles for the years 
1847-1873. Subsequently twenty-five papers have appeared in the 
Annalen alone, and about half as many others elsewhere. 

But such work as that of Clausius is not measured by counting 
titles or pages. His true monument lies not on the shelves of 
libraries, but in the thoughts of men, and in the history of more 
than one science. 


und eraeugte Wanne.’’ Pogg. Ann., vol. Ixxxvii, p. 415 (1852). “ Ueber die An- 

wendung der mechanischen Wiirraetheorie auf die thermoeleotrisohen Ersoheinungen.” 
Poyg. Ann., vol. xc, p. 513(1853). “Ueber die Electricitatsleitung in Electrolyten.” 
Pogg. Ann., vol. ci, p. 338 (1857). 

* The first volume of this work appeared in 1876, and contained the general theory 
with the more immediate consequences of the two fundamental laws. The third volume 
has not yet appeared, but it is expected very soon, edited by Professor Planck and 
Dr. Pulfrich. In a certain sense this work may be regarded as a second edition of an 
earlier one (1864 and 1867), which consisted of a reprint of papers and had the title 
“ Abhandlungen fiber die mechanische Warmetheorie.” 
t Wied. Ann., vol. x, p. 608; vol. xi, p. 604. 

Xltnd.y vol. xvi, p. 629; vol. xvii, p. 713. 

%rbid.^ vol. XX, p. 363; vol. xxi, p. 385. 
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HUBERT ANSON NEWTON. 

[American Journal of Science, sor. 4, vol. ill, pp. 359-376, May 1897.] 
(Read before the National Academy of Sciences, in April, 1897.) 

Hubert Anson Newton was born on March 19th, 1830, at 
Sherburne, N.Y., and died at New Haven, Conn., on the 12th day of 
August, 1896. He was the fifth son of a family of seven sons and 
four daughters, children of William and Lois (Butler) Newton. The 
parents traced their ancestory back to the first settlers of Massa¬ 
chusetts and Connecticut,* and had migrated from tlie latter to 
Sherburne, when many parts of central New York were still a 
wilderness. They both belonged to families remarkable for longevity, 
and lived themselvBs to the ages of ninety-three and ninety-four 
years. Of the children, all the sons and two daughters were living as 
recently as the year 1889, the j^oungest being then fifty-three years 
of age. William Newton was a man of considerable enterprise, and 
undertook the construction of the Buffalo section of the Erie canal, 
as well as other work in canal and railroad construction in New 
York and Pennsylvania, In these constructions he is said to have 
relied on his native abilities to think out for himself the solution 
of problems which are generally a matter of technical training. His 
wife was remarkable for great .strength of character united with a 
quiet temperament and well-balanced mind, and was noted among 
her neighbors for her mathematical powers. 

Young Newton, whose mental endowments were thus evidently 
inherited, and whose controlling tastes were manifested at a very 
early age, fitted for college at the schools of Sherburne, and at the 
age of sixteen entered Yale College in the class graduating in 1850. 
After graduation he pursued his mathematical studies at New Haven 
and at home, and became tutor at Yale in January, 1853, when on 
account of the sickness and death of Professor Stanley the whole 
charge of the mathematical department devolved on him from the first. 

* Richard Butler, the great-grandfather of Lois Butler, came over from England 
before 1633, and was one of those who removed from Cambridge to Hartford. An 
ancestor of William Ne\/ton came directly from England to the New Haven colony 
about the middle of the same century. 
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In 1855, he was appointed professor of mathematics at the early 
age of twenty-five. This appointment testifies to the confidence 
which was felt in his abilities, and is almost the only instance in 
which the Yale Corporation has conferred the dignity of a full 
professorship on so young a man. 

This appointment being accompanied with a leave of absence for a 
year, in order to give him the opportunity to study in Europe, it was 
but natural that he should be attracted to Paris, w hore Chasles was 
expounding at the Sorborine that modern higher geometry of which 
he was to so large an extent the creator, and which appeals so 
strongly to the sense of the beautiful. And it was inevitable that 
the student should be profoundly impressed by the genius of his 
teacher and by the fruitfulness and elegance of the methods which he 
was introducing. The effect of this year’s study under the inspiring 
influence of such a master is seen in several contributions to the 
Matherruitical Monthly during its brief existence in the years 1858-61. 
One of these was a problem which attracted at once the attention 
of Cayley, wdio sent a solution. Another was a discussion of the 
problem “ to draw a circle tangent to three given circles,” rt^markable 
for his use of the principle of inversion. A third w^as a very 
elaborate memoir on the construction of curves by the straight edge 
and compasses, and by the straight edge alone. These early esvsays in 
geometry sliow a mind thoroughly imbued with the spirit of modem 
geometry, skilful in the use of its methods, and eager to extend the 
bounds of our knowledge. 

Nevertheless, although for many years the higher geometry was 
with him a favorite subject of instruction for his more advanced 
students, either his own preferences, or perhaps rather the influence 
of his environment, was destined to lead him into a very different 
field of research. In the attention which has Ixien paid to astronomy 
in this country we may recognize the history of the world repeating 
itself in a new country in respect to the order of the development of 
the sciences, or it may be enough to say that the questions which 
nature forces on us are likely to get more attention in a new country 
and a bustling age, than those which a reflective mind puts to itself, 
and that the love of abstract truth which prompts to the construction 
of a system of doctrine, and the refined taste which is a critic of 
methods of demonstration, are matters of slow growth. At all events, 
when Professor Newton was entering upon his professorship, the 
study of the higher geometry was less consonant with the spirit of 
the age in this country than the pursuit of astronomical knowledge, 
and the latter sphere of activity soon engrossed his best efforts. 

Yet it was not in any of the beaten paths of astronomers that 
Professor Newton was to move. It was rather in the wilds of a 
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terra incognita^ which astronomers had hardly troubled themselves to 
claim as belonging to their domain, that he first labored to establish 
law and order. It was doubtless not by chance that he turned his 
attention to the subject of shooting stars. The interest awakened in 
this country by the stupendous spectacle of 1833, which was not seen 
in Europe, had not died out. This was especially true at New Haven, 
where Mr. Edward C. Herrick was distinguished for his indefatigable 
industry both in personal observation and in the search for records of 
former showers. A rich accumulation of material was thus awaiting 
development. In 1861, the Connecticut Academy of Arts and Sciences 
appointed a committee “ to communicate with observers in various 
localities for combined and systematic observations upon the August 
and November meteors.*' In this committee Professor Newton was 
preeminently active. He entered zealously upon the work of col¬ 
lecting material by personal observation and correspondence and by 
organizing corps of observers of students and others, and at the same 
time set himself to utilize the material thus obtained by the most 
careful study. The value of the observations collected was greatly 
increased by a map of the heavens for plotting meteor-paths, which 
was prepared by Professor Newton and printed at the expense of the 
Connecticut Academy for distribution among observers. 

By these organized efiEbrts, in a great numbt^r of cases observations 
were obtained on the same meteor as seen from different places, and 
the actual path in the atmosphere was computed by Professor Newton. 
In a paper published in 1865”*^ the vertical height of the beginning 
and the end of the visible part of the path is given for more than 
one hundred meteors observed on the nights of August 10th and 
November 13th, 1863. It was shown that the average height of the 
November meteors is fifteen or twenty miles higher than that of 
the August meteors, the former beginning in the mean at a height 
of ninety-six miles and ending at sixty-one, the latter beginning at 
seventy and ending at fifty-six. 

We mention this paper first, because it seems to represent the 
culmination of a line of activity into which Professor Newton had 
entered much earlier. We must go bewik to consider other papers 
which he had published in the meantime. 

His first papers on this subject, 1860-62,t were principally devoted 
to the determination of the paths and velocities of certain brilliant 
meteors or fireballs, which had attracted the attention of observers in 
different localities. Three of these appeared to have velocities much 
greater than is possible for permanent members of the solar system. 
To another a particular interest attached as belonging to the August 

* Amer. Jour. Sci., ser. 2, vol. xl, p. 250. 

t Amer. Jour. Sci.y ser. 2, vol. xxx, p. 186; xxxii, p. 448; and xxxiii, p, 838. 
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shower, although exceptional in size. For this he calculated the 
elements of the orbit which would give the observed path and velocity. 
But the determination of the velocity in such cases, which depentjs 
upon the estimation by the observers of the time of flight, is neces¬ 
sarily very uncertain, and at best affords only a lower limit for the 
value of the original velocity of the body before it encountered the 
resistance of the earth's atmosphere. This would seem to constitute an 
insuperable difficulty in the determination of the orbits of meteoroids, 
to use the term which Professor Newton applied to these bodies, 
before they enter the earth's atmosphere to appear for a moment as 
luminous meteors. Yet it has been completely overcome in the case 
of the November meteors, or Leonids as thej^^ are called from the 
constellation from which they appear to radiate. This achievement 
constitutes one of the most interesting chapters in the history of 
meteoric science, and gives the subject an honorable place among the 
exact sciences. 

In the first place, by a careful study of the records, Professor 
Newton showed that the connection of early showers with those of 
1799 and 1833 had been masked by a progressive change in the time 
of the year in which the shower occurs. This change had amounted 
to a full month between a.d, 902, when the shower occurred on 
October 13, and 1833, when it occurred on November 13. It is in 
part due to the precession of the equinoxes, and in part to the motion 
of the node where the earth’s orbit meets that of the meteoroids. 
This motion must be attributed to the perturbations of the orbits of 
the meteoroids which are produced by the attractions of the planets, 
and being in the direction opposite to that of the equinoxes, Professor 
Newton inferred that the motion of the meteoroids must be retrograde. 

The show'ers do not, however, occur whenever the earth passes the 
node, but only when the passage occurs within a year or two before 
or after the termination of a cycle of 32*25 years. This number is 
obtained by dividing the interval between the showers of 902 and 
1833 by 28, the number of cycles between these dates, and must 
therefore be a very close approximation. For if these showers did 
not mark the precise end of cycles the resultant error would be 
divided by 28. Professor Newton showed that this value of the cycle 
requires that the number of revolutions performed by the meteoroids 
in one 3 ^ear should be either 2:^^ or l±i“ or In other words, 

the periodic time of the meteoroids must be either 180*0 or 185*4 or 
354'6 or 376*6 days, or 33*25 years. Now the velocity of any body 
in the solar system has a simple relation to its periodic time and its 
distance from the sun. Assuming, therefore, any one of these five 
values of the periodic time, we have the velocities of the Leonids 
at the node v^ery sharply determined. From this velocitj^, with the 
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position of the apparent radiant, which gives the direction of the 
relative motion, and with the knowledge that the heliocentric motion 
is retrograde, we may easily determine the orbit. 

We have, therefore, five orbits from which to choose. The 
calculation of the secular motion of the ncxle due to the disturbing 
action of the planets, would enable us to decide between these orbits. 

Such are the most important conclusions which Professor Newton 
derived from the study of these remarkable showers, interesting not 
only from the magnificence of the spectacle occasionally exhibited, 
but in a much higher degree froiri the peculiarity in the periodic 
character of their occurrence, which affords the means of the deter¬ 
mination of the orbit of the meteoroids with a precision whicii would 
at first sight appear impossible. 

Professor Newton anticipated a notable return of the shower in 
1866, with some precursors in the years immediately preceding, a 
prediction which was amply veritied. In the meantime he turned 
his attention to the properties which belong to shooting stars in 
general, and especially to those average values which relate to large 
numbers of these Ixjdies not belonging to any particular swarm. 

This kind of investigation Maxwell has called statisticdl, and has 
in more than one passage signalized its difficulties. The writer 
recollects a passage of Maxwell which was pointed out to him by 
Professor Newton, in which the author .says that serious errors have 
been made in such inquiries by men whose compettmey in other 
branches of mathematics was unquestioned. Doubtless Professor 
Newton was very conscious of the necessity of caution in these 
inquiries, as is indeed abundantly evident from the manner in which 
he expressed his conclusions; but the writer is not aware of any 
passage in which he has afforded an illustration of Maxwelbs remark. 

The results of these investigations appeared in an elaborate memoir 
“ On Shooting Stars,” which was read to the National Academy in 
1864, and appeared two years later in the Memoirs of the Academy * 
An abstract was given in the Avierican Jouimal of Science in 18G5.t 
The following are some of the subjects treated, with some of the more 
interesting results; 

The distribution of the apparent paths of shooting stars in azimuth 
and altitude. 

The vertical distribution of the luminous part of the real paths. 
The value found for the mean height of the middle point of the 
luminous path was a trifle less than sixty miles. 

The mean length of apparent paths. 

The mean distance of paths from the observer. 


‘ Vol. i, 3d memoir. 
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The mean foreshortening of paths. 

The mean length of the visible part of the real paths. 

The mean time of flight as estimated by observers. 

The distribution of the orbits of meteoroids in the solar system. 

The daily number of shooting stjirs, and the density of the 
meteoroides in the space which the earth traverses. 

The average number of shooting stars which enter the atmosphere 
daily, and which are large enough to be visiV)le to the naked eye, 
if the sun, moon and clouds would permit it, is more than seven 
and a half millions. Certain observations with instruments seem 
to indicate that this number should be increased to more than 
four hundred millions, to include telescopic shooting stars, and there 
is no reason to doubt that an increase of optical power beyond that 
employed in these observations would reveal still larger numbers of 
these small bodies. In each volume of the size of the earth, of the 
space which the earth is traversing in its orbit about the sun, there 
are as many as thirteen thousand sniall bodies, each of which is 
such as would furnish a shooting star visible under favorable 
circumstances to the naked eye. 

These conclusions are certainly of a startling character, but not of 
greater interest than those relating to the velocity of meteoroids. 
There are two velocities to be considered, which are evidently con¬ 
nected, the velocity relative to the earth, and the velocity of the 
meteoroids in the solar system. To the latter, great interest attaches 
from the fact that it determines the nature of the orbit of the 
meteoroid. A velocity equal to that of the earth, indicates an orbit 
like that of the earth; a velocity ^2 times as great, a parabolic 
orbit like that of moat comets, while a velocity greater than this 
indicates a hyperbolic orbit. 

Professor Newton sought to form an estimate of this critical 
quantity in more than one way. That on which he placed most 
reliance was based on a comparison of the numbers of shooting stars 
seen in the different hours of the night. It is evident that in the 
morning, when we are in front of the earth in its motion about the 
sun, we should see more shooting stars than in the evening, when we 
are behind the earth; but the greater the velocity of the meteoroids 
compared with that of the earth, the less the difference would be in 
the numbers of evening and morning stars.* 


* It may not be out of place to notice here an erratum which occurs both in the 
Memoirs cf the National Academy and in the abstract in the Arntrit^n Journal of 
Science, and which the writer finds marked in a private copy of Professor Newton’s. 
In the table on p. 20 of the memoir and 206 of the abstract, the column of numbers 
under the head “hour of the night” should be inverted. There is another displace¬ 
ment in the table in the memoir, which is, however, corrected in the abstract. 



274 


HUBEET ANSON NEWTON. 


After a careful discussion of the evidence Professor Newton reached 
the conclusion that “ we must regard as almost certain (on the 
hypothesis of an equable distribution of the directions of absolute 
motions), that the mean velocity of the meteoroids exceeds con¬ 
siderably that of the earth; that the orbits are not approximately 
circular, but resemble more the orbits of comets.*' 

This last sentence, which is taken from the abstract published in 
the American Journal of Science in 1865, and is a little more 
definitely and positively expressed than the corresponding passage 
in the original riiemoir, indicating apparently that the authors 
conviction had been growing more positive in the interval, or at 
least that the importance of the conclusion had been growing upon 
him, embodies what is perhaps the moat important result of the 
memoir, and derives a curious significance from the discoveries which 
were to astonish astronomers in the immediate future. 

The return of the November or Leonid shower in 1865, and 
especially in 1866, when the display was very brilliant in Europe, 
gave an immense stimulus to meteoric study, and an especial 
prominence to this group of meteoroids. “ Not since the year 
1759,” says Schiaparelli, “ when the predicted return of a comet first 
took place, had the verified prediction of a periodic phenomenon 
made a greatei impression than the magnificent spectacle of 
November, 1866. The study of cosmic meteors thereby gained the 
dignity of a science, and took finally an honorable place among 
the other branches of astronomy.” * Professor J. C. Adams, of 
Cambridge, England, then took up the calculation of the perturba¬ 
tions determining the motion of the node. We have seen that 
Professor Newton had shown that the periodic time was limited to 
five sharply determined values, each of which with the other data 
would give an orbit, and that the true orbit could be distinguished 
from the other four by the calculation of the secular motion of the 
node. 

Professor Adams first calculated the motion of the node due to the 
attractions of Jupiter, Venus, and the Earth for the orbit having a 
period of 354*6 days. This amounted to a little less than 12' in 
33*25 years. As Professor Newton had shown that the dates of the 
showers require a motion of 29' in 33*25 years, the period of 364*6 
days must be rejected. The case would be nearly the same with a 
period of 376*6 days, while a period of 180 or 185*4 days would give 
a still smaller motion of the node. Hence, of the five possible periods 
indicated by Professor Newton, four were shown to be entirely 
incompatible with the motion of the node, and it only remained to 
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examine whether the fifth period, viz. that of 33'25 years, would give 
a motion of the node in accordance with the observed value. As 
this period gives a very long ellipse for the orbit, extending a little 
beyond the orbit of Uranus, it was necessary to take account of the 
perturbations due to that planet and to Saturn. Professor Adams 
found 28' for the motion of the node. As this value must be regarded 
as sensibly identical with Professor Newton’s 29' of observed motion, 
no doubt was left in regard to the period of revolution or the orbit of 
the meteoroids.* 

About this time, M. Schiaparelli was led by a course of reasoning 
similar to Professor Newton’s to the same conclusion,— that the mean 
velocity of the meteoroids is not very different from that due to 
parabolic orbits. In the course of his speculations in regard to the 
manner in which such bodies might enter the solar system, the 
questions suggested themselves: whether meteoroids and comets may 
not have a similar origin ; whether, in case a swarm of meteoroids 
should include a body of sufficient size, this would not appear as a 
comet; and whether some of the known comets may not belong to 
streams of meteoroids. Calculating the orbit of the Perseids, or 
August meteoroids, from the radiant point, with the assumption of a 
nearly parabolic velocity, he found an orbit very similar to that of 
the great comet of 1862, whicli may therefore be considered as one 
of the Perseids,—probably the largest of them all.t 

At that time no known cometic orbit agreed with that of the 
Leonids, but a few months later, as soon as the definitive elements of 
the orbit of the first comet of 1866 were published, their resemblance 
to those of the Leonids, as calculated for the period of 33 25 years, 
which had been p>roved to be the correct value, was strikingly 
manifested, attracting at once the notice of several astronomers. 

Other relations of the same kind have been discovered later, of 
which that of Biela’s comet and the Andromeds is the most interesting, 
as we have seen the comet breaking up under the influence of the 
sun; but in no case is the coincidence so striking as in that of 
the Leonids, since in no other case is the orbit of the meteoroids 
completely known, independently of that of the comet, and without 
any arbitrary assumption in regard to their periodic time. 

The first comet of 1866 is probably not the only one belonging to 
the Leonid stream of meteoroids. Professor Newton has remarked 
that the Chinese annals mention tw^o comets which passed rapidly in 
succession across the sky in 1366, a few days after the passage of the 
earth through the node of the Leonid stream, which was marked in 
Europe by one of the most remarkable star-showers on record. The 


MofiUhly Notictft Roy. A»t. Soc.j voL xxvii, p. 247. 
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course of these comets, as described by the annalists, was in the line 
of the Leonid stream.* 

This identification of comets with meteors or shootin^^-stars marks 
an epoch in the study of the latter. Henceforth, they must be 
studied in connection with comets. It was presumably this discovery 
which led Professor Newton to those statistical investigations 
respecting comets, which we shall presently consider. At this point, 
however, at the close as it were of the first chapter in the history of 
meteoric science, it seems not unfitting to (piote tlie words of an 
eminent foreign astronomer, written about this time, in regard to 
Professor Newton’s c<mtributtons to this subject. In an elalx>rate 
memoir in the Gompten Rciuluf^, M. Faye says, with reference to our 
knowledge of shooting-stars and their orbits, “ we may find in the 
works of M. Newton, of the United States, the most advanced 
expression of the state of science on this subject, and even the germ, 
I think, of the very remarkable ideas brought forward in these last 
days by M. Schiaparelli and M. Le Verrier.” i 

The first fruit of Profeasor Newton’s statistical studies on comets 
appeared in 1878 in a paper “On the Origin of Comets.” Jn this 
paper he considers the distribution in the solar system of the known 
cometic orbits, and compares it with what we might expect on either 
of two hypotheses: that of Kant, that the comets were formed in the 
evolution of the solar system from the more distant portion of the 
solar nebula ; and that of Laplace, that the coniets have come from the 
stellar .spaces and in their origin had no relation to the solar system. 

In regard to the distribution of the aphelia, he shows that, except 
so far as modified by the perturbations due to the planets, the theory 
of internal origin would require all the aphelia to be in the vicinity 
of the ecliptic ; the theory of external origin would make all direc¬ 
tions of the aphelia equally probable, i.e., the distribution in latitude 
of the aphelia should be that in which the frequency is as the cosine 
of the latitude. The actual distribution comes very near to this, but 
as the effect of perturbations would tend to equalize the distribution 
of aphelia in all directions, Professor Newton does not regard this 
argument as entirely decisive. He remarks, however, that if Kant’s 
hypothesis be true, the comets mu.st have been revolving in their 
orbits a very long time, and the process of the disintegration of comets 
must be very slow. 

In regard tt) the distribution of the orbits in inclination, the author 
shows that the theory of internal origin would make all inclinations 
equally probable; the theory of external origin would make all 

* Amer. Jour. Sci., acr. 2, vol. xliii, p. 2t)8, and vol. xlv, p. 91, or Encycl. Hriiann., 
article Meteor. 

t Comptts Rtndius, t. Ixiv, p. 551. 
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directions of the normal to the plane of the orbit equally probable. 
On the first hypothesis, therefore, we should expect a uniform 
distribution in inclination; on the second, a frequency proportioned 
to the sine of the inclination. It was shown by a diagram in which 
the actual and the two theoretical distribution.^ are represented 
graphically, that the actual distribution agrees pretty well with 
the theory of external origin and not at all with that of internal 
origin. It was also shown that the curve of actual distribution 
cannot be made to agree with Kant’s hypothesis by any simple 
and reasonable allowances for perturbations. On the other hand, 
if we assume the external origin of comets, and ask how the curve 
of sines must be modified in order to take account of perturbations, 
it is shown that the principal effect will be to increase somewhat 
the number of inclinations between 90° and 135° at the expense of 
those between 45*" and 90°. It is apparent at once from the diagram 
that such a change would make a very good agreement between the 
actual and theoretical curves, the only impoitant difference remaining 
being due to comets of short periods, which mostly have small 
inclinations with direct motion. These should not weigh very much, 
the author observes, in the general question of the distribution of 
inclinations, because they return so frequently and are so easily 
detected that their number in a list of observed comets is out of all 
proportion to their number among existing comets. But this group 
of comets of short periods can easily be explained on the theory of an 
external origin. For such comets must have lost a large part of 
their velocity by the influence of a planet. This is only likely to 
happen when a comet overtakes the planet and passes in front of it. 
This implies that its original motion was direct and in an orbit of 
small inclination to that of the planets, and although it may lose a 
large part of its velocity, its motion will generally remain direct and 
in a plane of small inclination. This very interesting case of the 
comets of short periods and small inclinations, which was treated 
rather briefly in this paper, was discussed more fully by Professor 
Newton at the meeting of the British Association in the following 
year.* 

Many years later, Professor Newton returned to the same general 
subject in a very interesting memoir On the Capture of Comets by 
Planets; especially their Capture by Jupiter,” which was read before 
the National Academy in 1891, and appeared in the Memoirs of the 
Acojdemy two years later, t It also appeared in the American 
Jov/mal of Science in the year in which it was read, f This contains 

* Brit. Aaaoc. Adv. Sci. for 1879, p. 272. 

fMem, Nat. Acad., vol. vi, Ist memoir. 

XAimt. Jour. Bd.f ser. 3, vol. ilii, pp. 183 ajid 482. 
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the results of careful statistical calculations on the effect of per¬ 
turbations on orbits of comets originally parabolic. It corroborates 
the more general statements of the paper “ On the Origin of Comets ” 
giving them a precise quantitative form. One or two quotations 
will give some idea of the nature of this very elaborate and curious 
memoir, in which, however, the results are largely presented in the 
form of diagrams. 

On a certain hypothesis regarding an original equable distribution 
of comets in parabolic orbits about the sun, it is shown that “ if in 
a given period of time a thousand million comets come in parabolic 
orbits nearer to the sun than Jupiter, 126 of them will have their 
orbits changed” by the action of that planet “into ellipses with 
periodic times less than one-half that of Jupiter; 889 of them will 
have their orbits changed into ellipses with periodic times less than 
that of Jupiter; 1701 of them will have their orbits changed into 
ellipses with periodic times less than once and a half that of Jupiter, 
and 2670 of them will have their orbits changed into ellipses with 
periodic times less than twice that of Jupiter.” A little later, Pro¬ 
fessor Newton considers the question, which he characterizes as 
perhaps more important, of the direct or retrograde motion of the 
comets after such perturbations. It is shown that of the 839 comets 
which have periodic times less than Jupiter, 203 will have retrograde 
motions, and 636 will have direct motions. Of the 203 with retro¬ 
grade motion, and of the 636 with direct motion, 51 and 267, respec¬ 
tively, will have orbits inclined less than SO*" to that of Jupiter. 

We have seen that the earliest of Professor Newton’s more important 
studies on meteors related to the Leonids, which at that time far 
surpassed all other meteoric streams in interest. One of his later 
studies related to another stream which in the mean time had acquired 
great importance. The identification of the orbit of the Andromed 
meteors with that of Biela’s comet, which we have already mentioned, 
gave these bodies a unique interest, as the comet had been seen to 
break up imder the influence of the sun. Here the evolution of 
meteoroids was taking place before our eyes; and this interest was 
heightened by the showers of 1872 and 1885, which in Europe seem 
to have been unsurpassed in brilliancy by any which have occurred 
in this century. 

The phenomena of each of these showers were carefully discussed 
by Professor Newton. Among the principal results of his paper on 
the latter shower are the following :* 

The time of the maximum frequency of meteors was Nov. 27,1886 
6 h. 16 m. Gr. m. t. The estimated number per hour visible at one 
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place was then 76,000. This gives a density of the meteoroids in 
space represented by one to a cube of twenty miles edge. Three 
hours later the frequency had fallen to one-tenth of the maximum 
value. The really dense portion of the stream through which we 
passed was less than 100,000 miles in thickness, and nearly all would 
be included in a thickness of 200,000 miles. 

A formula is given to express the effect of the earths attraction 
on the approaching meteoroids in altering the position of the radiant. 
This is technically known as the zenithal attraction, and is quite 
important in the case of these meteors on account of their small 
relative velocity. The significance of the formula may be roughly 
expressed by saying that the earth’s attraction changes the radiant of 
the Biela meteors, toward the vertical of the observer, one-tenth of 
the observed zenith distance of the radiant,.or more briefly, that the 
zenithal attraction for these meteors is one-tenth of the observed 
zenith distance. The radiant even after the correction for zenithal 
attraction, and another for the rotation of the earth on its axis, is not 
a point but an area of several degrees diameter. The same has been 
observed in regard to other showers, but the result comes out more 
distinctly in the present case because the meteors were so numerous 
and the shower so carefully observed. 

This implies a want of parallelism in the paths of the meteors, and 
it is a very important question whether it exists before the meteoroids 
enter our atmosphere, or whether it is due to the action of the 
atmosphere. 

Professor Newton shows that it is diflScult to account for so large 
a difference in the original motions of the meteoroids, and thinks it 
reasonable to attribute a large part of the want of parallelism to the 
action of the atmosphere on bodies of an irregular form, such as we 
have every reason to believe that the meteoroids have, when they 
enter our atmosphere. The effect of the heat generated will be to 
round off the edges and prominent parts, and to reduce the meteor 
to a form more and more spherical. It is, therefore, quite natural 
that the greater portion of the curvature of the paths should be 
in the invisible portion and thus escape our notice. It is only in 
exceptional cases that the visible path is notably curved. 

But the great interest of the paper centers in his discussion of the 
relation of this shower to preceding showers, and to the orbit of 
Biela’s comet. The changes in the date of the shower (from Dec. 6 
to Nov. 27) and in the position of the radiant are shown to be related 
to the great perturbations of Biela’s comet in 1794,1831, and 1841-2. 
The showers observed by Brandes, Dec. 6th, 1798, by Herrick, Dec. 7th, 
1888, and by Heis, Dec. 8th and 10th, 1847, are related to the orbit 
of Biela’s comet as it was in 1772; while the great showers of 1872 
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and 1885, as well as a trifling display in 1867, are related to the 
orbit of 1852.* 

Assuming, then, that the meteoroids which we met on the 27th of 
November, 1872, did not leave the immediate neighborhood of the 
Biela comet before 1841-2, we seem to have the data for a very 
precise determination of their orbit between those dates. The same 
is true of those which we met in 1885. The computation of these 
orbits, the author remarks, may possibly give evidence for or against 
the existence of a resisting medium in the solar system. 

In his last public utterance on the subject of meteors, which was 
on the occasion of the recent sesqxiicentennial celebration of the 
American Philosophical Society, Professor Newton returns to the Biela 
meteoroids, and finds in the scattering which they show in the plane 
of their orbit the proof of a disturbing force in that plane, and there¬ 
fore not due to the planets. The force exerted by the sun appears to 
be modified somewhat as w'e see it in the comet’s tails, where indeed 
the attraction is changed into a repulsion. Something of the same 
sort on a smaller scale relatively to the mass of the bodies appears to 
riKxlify the sun’s action on the meteoroids. 

In 1888 Professor Newton read a paper before the National 
Academy “ Upon the relation which the former Orbits of those 
Meteorites that are in our collections, and that were seen to fall, 
had to the Earth’s Orbit.” This was based upon a very careful study 
of more than 116 cases for which we have statements indicating more 
or less definitely the direction of the path through the air, as well a/S 
94 cases in which we only know the time of the fall. The results are 
expressed in the following three propositions: 

1. The meteorites which we have in our cabinets and which were 
seen to fall were originally (as a class, and with a very small 
number of exceptions) moving about the sun in orbits that had 
inclinations less than 90®; that is, their motions were direct, not 
retrograde, 

2. The reafjon why we have only this class of stones in our collec¬ 
tions is not one wholly or even mainly dependent on the habits of 
men; nor on the times when men are out of doors; nor on the places 
where men live; nor on any other principle of selection acting at or 
after the arrival of the stones at the ground. Either the stones 
which are moving in the solar system across the earth’s orbit move 
in general in direct orbits; or else for some reason the stones which 

* It is a curious coincidence that the original discoverer of the December shower as 
a periodic phenomenon, Mr. Edward C. Herrick, should have been (with a companion, 
Mr. Francis Bradley,} the first to observe that breaking up of the parent body which 
was destined to reinforce the meteoric stream in so remarkable a manner. See Amer, 
Jour. Sci ., aer. 3, vol. xxxi, pp. 85 and 88. 
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move in retro^ade orbits do not in general come through the air to 
the ground in solid form. 

3. The perihelion distances of nearly all the orbits in which these 
stones moved were not less than 0*5 nor more than 1*0, the earth’s 
radius vector being unity. 

Professor Newton adds, that it seems a natural and proper corollary 
to these propositions (unless it shall appear that stones meeting the 
earth are destroyed in the air) that the larger meteorites moving in 
our solar system are allied much more closely with the group of 
comets of short period than with cornets whose orbits are nearly 
parabolic. All the known cornets of shorter periods than 33 years 
move al:>out the sun in direct orbits that have moderate inclinations 
to the ecliptic. On the contrary, of the nearly parabolic orbits that 
are known only a small proportion of the whole number have small 
inclinations with direct motion. 

We have briefly mentioned those papers which seem to constitute 
the niost important contributions to the science of meteors and comets. 
To fully appreciate Professor Newton's activity in this field, it would 
Ikj necessary to tak(^ account of his minor contributions * 

Most interesting and instructive to the general reader are his 
utterances on occasions when h(‘ has given a resumd of our knowledge 
on these subjects or some branch of them, as in the address “ On the 
Meteorites, the Meteors, and the 8lnx)ting Stars,” which he delivered 
in 1886 as retiring president of the American Association for the 
Advancement of Science, or in certain lectures in the public courses 
of the Sheffield Scientific School of Yale University, entitled “The 
st/ory of Piela’s Comet” (1874), “The relation of Meteorites to 
Comets” (1876), ‘ The Worship of Meteorites” (1889), or in the 
articles on Meteors in the Encyiiojxf'dia Brltanvica and Johnson's 
Cyclofoidia. 

If we ask what traits of mind and character are indicated by 
these papers, the answer is not difficult. Professor Klein has divided 
mathematical minds into three leading classes: the logicians, whose 
pleasure and power lies in subtility of definition and dialcKitic skill; 
the geometers, whose power lies in the use of the space-intuitions; 
and the formalists, wlio seek to find an algorithm for every operation.t 
Professor Newton evidently belonged to the second of these classes, 
and his natural tastes seem to have found an equal gratification 
in the development of a system of abstract geometric truths, or 


* These were detailed in a bibliography annex^'d to this paper in Amtr. Jour. Sci.^ 
aer. 4, vol. iii. 

t LeclurtH on Maihematics (Evanston), p. 2. 
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in the investigation of the concrete phenomena of nature as they 
exist in space and time. 

But these papers show more than the type of mind of the author; 
they give no uncertain testimony concerning the character of the 
man. In all these papers we see a love of honest work, an aversion 
to shams, a distrust of rash generalizations and speculations based 
on uncertain premises. He was never anxious to add one more 
guess on doubtful matters in the hope of hitting the truth, or what 
might pass as such for a time, but was always willing to take 
infinite pains in the most careful test of every theory. To these 
qualities was joined a modesty which forbade the pushing of his 
own claims^ and desired no reputation except the unsought tribute 
of competent judges. At the close of his article, on meteors in the 
Eneyclopcedia Britannica, which has not the least reference to himself 
as a contributor to the science, he remarks that “ meteoric science 
is a structure built stone by stone by many builders."' We may 
add that no one has done more than himself to establish the 
foundations of the science, and that the stones which he has laid 
are not likely to need relaying. 

The value of Profeasor Newton’s work has been recognized by 
learned societies and institutions lx>th at home and abroad. He 
received the honorary degree of Doctor of Laws from the University 
of Michigan in 1868. He was president of the section of Mathematics 
and Astronomy in the American Association for the Advancement 
in Science in 1875, and president of the Association in 1885. On 
the first occasion he delivered an address entitled “ A plea for the 
study of pure Mathematics "; on the second the address on Meteorites, 
etc., which we have already mentioned. Of the American Mathe¬ 
matical Society he was vice-president at the time of his death. 
In 1888 the J. Lawrence Smith gold medal was awarded to him 
by the National Academy for his investigations on the orbits of 
meteoroids. We may quote a sentence or two from his reply to 
the address of presentation, so characteristic are they of the man 
that uttered them: “ To discover some new truth in nature,” he 
said, “ even though it concerns the small things in the world, gives 
one of the purest pleasures in human experience. It gives joy to 
tell others of the treasure found.” 

Besides the various learned societies in our own country of which 
he was a member, including the American Academy of Arts and 
Sciences from 1862, the National Academy of Sciences from its 
foundation in 1863, the American Philosophical Society from 1867, 
he was elected in 1872 Associate of the Royal Astronomical Society 
of London, in 1886 Foreign Fellow of the Royal Society of Edinburgh, 
and in 1892 Foreign Member of the Royal Society of London. 
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But the studies which have won for their author an honorable 
reputation among men of science of all countries, form only one 
side of the life of the man whom we are considering. Another 
side, probably the most important, is that in which he was identified 
with the organic life of the College and University with which 
he had been connected from a very early age. In fact, we might 
almost call the studies which we have been considering, the recreations 
of a busy life of one whose serious occupation has been that of an 
instructor. If from all those who have come under his instruction 
we should seek to learn their personal recollections of Professor 
Newton, we should probably find that the most universal impression 
made on his students was his enthusiastic love of the subject which 
he was teaching. 

A department of the University in which he took an especial 
interest was the Observatory. This was placed under his direction 
at its organization, and although he subsequently resigned the 
nominal directorship, the institution remained virtually under his 
charge, and may be said to owe its existence in large measure 
to his untiring efforts and personal sacrifice in its behalf. 

One sphere of activity in the Observatory was suggested by a 
happy accident which Professor Newton has described in the American 
Journal of Science, September, 1893. An amateur astronomer in 
a neighboring town, Mr. John Lewis, accidentally obtained on a 
stellar photograph the track of a large meteor. He announced in 
the newspapers that he had secured such a photograph, and requested 
observations from those who had seen its flight. The photographic 
plate and the letters received from various observers were placed in 
Professor Newton’s hands, and were discussed in the paper mentioned. 
The advantages of photographic observations were so conspicuous 
that Professor Newton was anxious that the Observatory should 
employ this method of securing the tracks of meteors. With the 
aid of an appropriation granted by the National Academy from the 
income of the J. Lawrence Smith fund, a battery of cameras was 
mounted on an equatorial axis. By this means, a number of meteor- 
tracks have been obtained of the August meteors, and in one case, 
through a simultaneous observation by Mr. Lt^wis in Ansonia, Professor 
Newton was able to calculate the course of the meteor in the atmo¬ 
sphere with a probable error which he estimated at less than a mile. 
The results which may be expected at the now near return of the 
Leonids, will be of especial interest, but it will be for others to utilize 
them. 

Professor Newton was much interested in the collection of meteorites, 
and the fine collection of stones and irons in the Peabody Museum 
of Yale^ University owes much to his efforts in this direction. 
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Professor Newton was a member of the American Metrological 
Society from the first, and was conspicuously active in the agitation 
which resulted in the enactment of the law of 1866, legalizing the 
use of the metric system. He prepared the table of the metric 
equivalents of the customary units of weights and measures which 
was incorporated in the act, and by which the relations of 
the fundamental units were defined. But he did not stop here. 
Appreciating the weakness of legislative enactment compared with 
popular sentiment, and feeling that the real battle was to be won 
in familiarizing the people with the metric system, he took pains to 
interest the makers of scales and rulers and other devices for measure¬ 
ment in adopting the units and giaduations of the metric system, 
and to have the proper tables introduced into school arithmetics. 

He was also an active meml)er of the Connecticut Academy of 
Arts and Sciences, serving several years lx)th as secretary and 
president,—also as member of the council. He was associate editor 
of the American Jowrnal of Science from 1864, having especial 
charge of the department of astronomy. His notes on observations 
of meteors and on the progress of meteoric science, often very brief, 
sometimes more extended, but always well considered, were especially 
valuable. 

In spite of his studious tastes and love of a quiet life, he did not 
shirk the duties of citizenship, serving a term as alderman in the 
city council, being elected, we may observe, in a ward of politics 
strongly opposed to his own. 

Professor Newton married, April 14th, 1859, Anna C., daughter 
of the Rev. Joseph C. Stiles, D.D., of Georgia, at one time pastor 
of the Mercer Street Pre.sbyterian Church in New York City, and 
subsequently of the South Church in New Haven. She survived 
her husband but three months, leaving two daughters. 

In all these relations of life, the subject of this sketch exhibited 
the same traits of character which are seen in his published papers, 
the same modesty, the same conscientiousness, the same devotion 
to high ideals. His life was the quiet life of the scholar, ennobled 
by the unselfish aims of the Christian gentleman; his memory will 
be cherished by many friends; and so long as astronomers, while 
they watch the return of the Leonids marking off the passage of 
the centuries, shall care to turn the earlier pages of this branch 
of astronomy, his name will have an honorable place in the history of 
the science. 
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